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HIGH FIDELITY DNA POLYMERASE COMPOSITIONS AND USES THEREFOR 

FIELD OF THE INVENTION 

The present invention is related to the field of high fidelity polynucleotide synthesis. 

BACKGROUND OF THE INVENTION 

DNA polymerases catalyze the synthesis of DNA and can be found in all cells as well as 
being encoded in numerous viruses. Although all DNA polymerases possess S'-S' DNA 
polymerization activity, DNA polymerases differ from one another in numerous other properties. 
For example, some enzymatic activities that are possessed by some DNA polymerases, but 
absent in other DNA polymerases include: double stranded DNA 5*-3* exonuclease activity, 
single-stranded DNA 3 '-5' exonuclease activity, double-stranded 3*-5' DNA exonuclease activity, 
RNase H activity, reverse transcriptase activity, and the Uke. Additionally, different DNA 
polymerases may have different optimal pH and temperature ranges for activity. Furthermore, 
DNA polymerases may differ in the rate in which tliey catalyze DNA synthesis. 

Purified DNA polymerases have numerous uses in vitro. A detailed description of DNA 
polymerases, including methods for their isolation, can be fotmd among other places, in DNA 
Replication 2nd edition, by Komberg and Baker, W. H. Freeman & Company, New York, N.Y. 
1991. In vitro uses of DNA pol)anerases include, for example, the labeling and synthesis of 
hybridization probes, DNA sequencing, and DNA amplification. A DNA ampUfication method 
employing DNA poljmierases that has been particularly usefiil is the polymerase chain reaction 
(PGR) technique which employs the use of a thermostable DNA polymerase. 

The first thermostable DNA polymerase that is widely used for DNA amplification is Taq 
DNA poljroerase isolated from the thermostable, aerobic bacterium Thermus aquations, Taq 
DNA polymerase's enzymatic activity at high temperatures allows for primer extension and 
sequencing'Of polynucleotide templates with complex secondary structures (i.e., by PGR 
ampUfication). However, Taq DNA polymerase has significant error rate when incorporating 
nucleotides due to the lack of 3 '-5' exonuclease activity (i.e., proofireading activity), and 
therefore may not be suitable if the amplified sequence is to be used in further gene 
structural/fiinctional studies or cloning. 

In the last 10 years, numerous studies have quantified the error rate of thermostable DNA 
polymerases, and iseveral enzymes have been found to copy DNA more accurately than Taq 
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DNA polymerase (referred to as high fidelity DNA polymerases). U.S. Patent describing DNA 
polymerases include Nos. 4,492,130; 4,946,786; 5,210,036; 5,420,029; 5,489,523; 5,506,137; 
5,545,552; 5,618,711; 5,624,833; 6,238,905; 6,100,078; 6,077,664; 5,968,799; 5,948,663; 
5,885,713; 5,834,285; 5,756,334; 5,747,298; 5,744,312; 5,624,833; 5,602,011; 5,556,772. 

5 High fidelity polymerases alone should definitely increase fidelity rates but usually do 

not amplify long fiagments as efficient as a DNA polymerase lacking a 3'-5' exonuclease 
activity (e.g., Taq DNA polymerase). Enzyme mixtures that combine a standard polymerase 
with a small amount of proofreading polymerase may provide a balance between fidelity and 
yield. A study published in 1994 illustrated that the use of a liigh level of a DNA polymerase 

10 lacking 3 '-5' exonuclease activity (an exo" DNA polymerase, Klentaq-1) with a very low level of 
a thermostable DNA polymerase exhibiting 3 '-5' exonuclease activity (an exo*** DNA polymerase 
such as Pfii, Vent, or Deep Vent) generated products with increased base-pair fidelity with a 
maximum yield of 35 kb DNA from 1 ng of lambda DNA template (Barnes, Proceedings of the 
National Academy of Sciences, 91 :2216-20, 1994). Similarly, U.S. Patent Nos. 5,436,149 and 

15 6,008,205 disclosed metiiods for improving DNA amplification fidelity usiiig a DNA polymerase 
composition comprising a first enzyme substantially lacking 3 '-5' exonuclease activity and a 
second en2yme comprising 3'-5' exonuclease activity. In mixtures such as these, the exo**" 
enzyme acts to correct polymerization errors produced by the exo" DNA polymerase. 

The problem inherited in the above composition comprising the mix of two DNA 
20 polymerases is that the high polymerization activity resulted from combining the two DNA 
polymerases may inhibit the efficiency and flierefore the yield of the amplification reaction. 
Therefore, one can not increase fidelity by increasing the proportion of the proofreading DNA 
polymerase without compromising PGR product yield. It is also known that the amplification 
fidelity may also be affected by higli DNA polymerase concentration (see for example, Mattila et 
25 al., 1991, Polynucleotides Research, 19:4967-73). 

There is therefore a need in the art for new methods and compositions which improve 
polymerization fidelity and reduce the side effects resulted from having high polymerization 
activity in the reaction, 

SUMMARY OF THE BNTVENTION 

30 Tlie present invention provides an enzyme mixture comprising a first enzyme and a 

second enzyme, where the first enzyme comprises a DNA polymerization activity, and the 

- 2 • 
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second eiizj'me comprises a 3 '-5' exonuclease activity and a reduced DNA polymerization 
activity. . 

Hie present invention also provides an enzyme mixture comprising a first enzyme and a 
second enzyme, where the first enzyme is a wild type Pfu DNA polymerase, the second enzyme 
is a mutant Pfii DNA polymerase comprising a 3 '-5* exonuclease activity and a reduced DNA 
polymerization activity. 

The present invention further provides an enzyme mixture comprising a first enzyme and 
a second enzyme, where the first enzyme is a Taq DNA polymerase, the second enzyme is a 
mutant Pfii DNA polymerase comprising a 3 '-5' exonuclease activity and a reduced DNA 
polymerization activity. 

The present invention also provides an enzyme mixture comprising a first enzyme and a 
second enzyme, where tlie first enzyme comprises a DNA polymerization activity and is a wild- 
type Pfu DNA polymerase or a wild-type Taq DNA polymerase, and the second enzyme is a 
mutant Pfu DNA polymerase comprising a 3 '-5' exonuclease activity and a reduced DNA 
polymerization activity. 

The present invention provides an enzyme mixture comprising two or more enzymes, 
where at least a first enzjone in the enzyme mixture comprises a DNA polymerization activity, 
and at least a second enzyme in the enzyme mixture comprises a 3 '-5' exonuclease activity and a 
reduced DNA polymerization activity. 

The present invention further provides a mutant Pfu DNA polymerase with reduced DNA 
polymerization activity, where the mutant Pfu DNA polymerase comprises one or more 
mutations at amino acid positions selected firom the group consisting of: T542, D543, K593, 
Y595, Y385, G387, and G388. 

The present invention still provides a composition comprising a mutant Pfu DNA 
polymerase, where the mutant DNA polymerase comprises one or more mutations at amino acid 
positions selected fi-om the group consisting of: T542, D543, K593, Y595, Y385, G387, and 
G388. 

The present invention provides a mutant Pfu DNA polymerase produced by introducing a 
mutation in to a polynucleotide encoding a wild type Pfu DNA polymerase to produce a mutant 
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Pfu DNA polymerase comprising one or more mutations at amino acid positions selected from 
the group consisting of: T542, D543, K593, Y595, Y385, G387, and G388. 

The present invention also provides a mutant Pfu DNA polymerase comprising a reduced 
DNA polymerization activity, where the mutant Pfii DNA polymerase is produced by the steps: 

5 (a) providing a polynucleotide encoding a wild-type Pfu DNA polymerase; 

(b) introducing one or more nucleotide mutations into the polynucleotide to produce a mutant 
polynucleotide encoding the mutant Pfu DNA polymerase; and 

(c) expressing the mutant polynucleotide to produce the mutant Pfii DNA polymerase, where the 
mutant Pfu DNA polymerase comprises one or more mutations at amino acid positions selected 

10 from the group consisting of: T542, D543, K593, Y595, Y385, G387, and G388. 

The present invention provides a composition comprising a mutant Pfu DNA polymerase 
produced by expressing a polynucleotide encoding a Pfu DNA polymerase Avith a reduced DNA 
polymerization activity, where the mutant Pfii DNA polymerase comprises one or more 
mutations at amino acid positions selected from the group consisting of: T542, D543, K593, 

15 Y595, Y3S5, G387, and G388. 

The present invention also provides a composition comprising a mutant Pfu DNA 
polymCTase comprising a reduced DNA polymerization activity, where the mutant Pfu DNA 
polymerase is produced by the steps: (a) introducing a mutation into a polynucleotide encoding a 
wildrtype Pfu DNA polymerase to produce a mutant polynucleotide encoding the mutant Pfii 

20 DNA polymerase comprising one or more mutations at amino acid positions selected from the 
group consisting of: T542, D543, K593, Y595, Y385; G387, and G388; (b) expressing the 
mutant polynucleotide to produce the composition comprising the mutant Pfu DNA polymerase. 

The present invention further provides a kit coinprising a first enzyme and a second 
enzyme, where the first enzyme comprises a DNA polymerization activity, the second enzyme 
25 comprises a 3'-5' exonuclease activity and a reduced DNA polymerization activity, and 
packaging material therefore. 

The present invention also provides a kit comprismg a first enzyme and a second enzyme, 
and packaging material therefor, where the first enzyme is a wild type Pfix DNA polymerase, the 
second enzyme is a mutant Pfii DNA polymerase comprising a 3 '-5' exonuclease activity and a 
30 reduced DNA polymerization activity. 



4 



wo 03/060144 



PCT/US02/40423 



The present invention further provides a kit comprising a first enzyme and a second 
enzyme, and packaging material therefore, where tlie first enzyme is a Taq DNA polymerase, 
and packaging material therefor, the second enzjane is a mutant Pfii DNA polymerase 
comprising a 3 '-5' exonuclease activity and a reduced DNA polymerization activity. 

The present invention provides a kit comprising an enzyme mixture which comprises a 
first enzyme and a second enzyme, where the first enzyme comprises a DNA polymerization 
activity and is a wild-type Pfu DNA polymerase or a wild-type Taq DNA polymerase, and the 
second enz5^ne is a mutant Pfu DNA polymerase comprising a 3 '-5' exonuclease activity and a 
reduced DNA polymerization activity, and packaging means therefor. 

The present invention also provides a kit comprising a mutant DNA polymerase which 
comprises a reduced DNA polymerization activity and packaging material therefor, where the 
mutant Pfu DNA polymerase comprises one or more mutations at amino acid positions selected 
fi-om the group consisting of: T542, D543, K593, Y595, Y385, G387, and G388. 

hi one embodiment, the first enzyme of the present invention is a DNA polymerase or a 
reverse transcriptase. 

Preferably, the DNA polymerase is selected firom the group consisting of: Taq DNA 
polymerase, Tth DNA polymerase, UlTma DNA polymerase, Tli DNA polymerase, Pfu DNA 
polymerase, KOD DNA polymerase, JDF-3 DNA polymerase, PGB-D DNA polymerase and 
DP1/DP2 DNA polymerase. 

In one embodiment of the present invention, the second enzyme is a mutant DNA 
polymerase. 

Preferably, the mutant DNA polymerase is derived firom a DNA polymerase different 
from the first enzyme. 

More preferably, the mutant DNA polymerase is derived from a DNA polymerase 
selected from the group consisting of: UlTma DNA polymerase, Tli DNA polymerase, Pfu DNA 
polymerase, KOD DNA polymerase, JDF-3 DNA polymerase, PGB-D DNA polymerase and 
DP1/DP2 DNA polymerase. 

Preferably, the mutant DNA polymerase comprises a mutation in its partitioning domain 
or the polymerase domain. 
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More preferably, the mutant Pfii DNA polymerase comprises one or more mutations at 
amino acid positions selected from the group consisting of: D405, Y410, T542, D543, K^93, 
Y595, Y385, G387, and G3S8. 

More preferably, the mutant Pfu DNA polymerase comprises one or more mutations 
5 selected from the group consisting of: D405E, Y410F, T542P, D543G. K593T, Y595S, Y3S5Q, 
Y385S, Y385N, Y385L, Y385H, G387S, G387P, and G388P. 

In a preferred embodiment of the present invention, the mutant Pfii DNA polymerase 
comprises a mutation of G387P. 

The'enzyme mixture, composition, or kit of the present invention may further comprises 
10 a PGR enliancing factor and/or an additive. 

Preferably, the enzyme mixture, composition, or kit comprising an enzyme mixture 
comprises a ratio of polymeriziation activity/exonuclease activity of (2.5-5U)/(0.02-5U). 

More preferably, the enzyme mixture, composition, or kit comprising an enzyme mixture 
comprises a ratio of polymerization activity/exonuclease activity of (2.5U)/(0.04-0.08U). 

1 5 In the enzyme mixture of the present invention, the first enzyme may be an enzyme of an 

enzyme blend, where the enzyme mixture is produced by mixing the enzyme blend with the 
second enzyme. 

Preferably, the enzyme blend comprises a wild-type Pfti DNA polymerase and a wild- 
type Taq DNA polymerase. 

20 Also preferably, the enzyme blend may fiirther comprise a PGR enhancing factor. 

The mutant Pfu DNA polymerase of the present invention may comprise one or more 
mutations selected from the group consisting of: T542P, D543G, K593T, Y595S, Y385Q. 
Y385S, Y385N, y385L, Y385H, G387S, G387P, and G388P. 

Preferably, flie mutant Pfii DNA polymerase comprises one or more mutations selected 
25 from the group consisting of: T542P, D543G, K593T, Y595S, Y385Q, Y385S, Y385N, Y385L, 
Y385H, G387S, G387P, and G388P. 
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The present invention provides an isolated polynucleotide comprisinig a nucleotide 
sequence encoding a mutant enzyme comprises a 3 '-5' exonuclease activity and a reduced DNA 
polymerization activity. 

Preferably, the mutant enzyme comprising a 3 '-5' exonuclease activity and a reduced 
5 DNA polymerization activity which is encoded by the isolated polynucleotide of the present 
invention is a mutant DNA polymerase or a mutant reverse transcriptase. 

More preferably, the isolated polynucleotide encodes a mutant Pfu DNA polymerase. 

More preferably, the isolated polynucleotide encodes a mutant Pfii DNA polymerase 
comprises one or more mutations at amino acid positions selected from the group consisting of: 
10 T542, D543, K593, Y595, Y385, G387, and G388. 

More preferably, the isolated polynucleotide encodes a mutant Pfii DNA polymerase 
comprises one or more mutations selected from the group consisting of: Y410F, T542P, D543G, 
K593T, Y595S, Y385Q, Y385S, Y385N, Y385L, Y385H, G387S, G387P, and G388P. 

The present invention provides a pair of isolated polynucleotides, where a first 
15 polynucleotide of the pair comprises a polynucleotide sequence encoding a first enzyme 

comprising a DNA polymerase activity, and a second polynucleotide of the pair comprises a 
polynucleotide sequence encoding an enzyme comprising a 3 '-5' exonuclease activity and a 
reduced DNA polymerization activity. 

The present invention also provides a pair of isolated polynucleotides, where a first 
20 polynucleotide of the pair comprises a polynucleotide sequence encoding a wild-type Pfii DNA 
polymerase or a Taq DNA polymerase, and a second polynucleotide of the pair comprises a 
polynucleotide sequence encoding an mutant Pfii DNA polymerase comprising a 3 '-5' 
exonuclease activity and a reduced DNA polymerization activity. 

Preferably, the second pol3aiucleotide of the pair comprises a polynucleotide sequence 
25 encoding a mutant Pfii DNA polymerase which comprises one or more mutations at amino acid 
positions selected from the group consistmg of: D405, Y410, T542, D543, K593, Y595, Y385, 
G387, and G388, 

Also preferably, the second polynucleotide of the pair comprises a polynucleotide 
sequence encoding a mutant Pfu DNA polymerase which comprises one or more mutations 
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selected from the group consisting of: D405E, Y410F, T542P, D543G, K593T, Y595S, Y385Q, 

Y385S, Y3S5N, Y385L, Y385H, G387S, G387P, and G388P, 

The present invention provides a method for DNA synthesis comprising: (a) 
providmg an enzyme mixtme of the present invention, the enzyme mixture comprising a 
5 first enzyme comprising a DNA polymerization activity, and a second enzyme comprising a 3'- 

5* exonuclease activity and a reduced DNA polymerization activity; and (b) contactmg the 

enzyme mixture with a nucleic acid template, where the enzyme mixture permits DNA syntliesis. 

Preferably, the nucleic acid template is a DNA or an RNA molecule, 

Tlie present invention provides a method for DNA synttxesis comprising: (a) 
1 0 providing an enzyme mixture of the present invention, the enzyme mixture comprising a 

wild type Pfo DNA polymerase as a first enzyme, and a mutant Pfu DNA polymerase as a 
second enzyme which comprises a 3 '-5' exonuclease activity and a reduced DNA polymerization 
activity; and (b) contacting the enzyme mixture with a nucleic acid template, where the 
enzyme mixture permits DNA synthesis. 

1 5 The present invention also provides a method for TA cloning of DNA synthesis product 

comprising: (a) providing an enzyme mixture of the present invention, the enzyme mixture 

comprising a Taq DNA polymerase as a first enzyme, and a mutant Pfu DNA polymerase as a 
second enzyme which comprises a 3 '-5' exonuclease activity and a reduced DNA polymerization 
activity; (b) contacting the enzyme mixture with a nucleic acid template, where the enzyme 

20 mixture permits DNA synthesis to generate a synthesized DNA product; and (c) inserting the 
synthesized DNA product into a TA cloning vector, 

BRIEF DESCRIPTION OF DRAWINGS 

Figure 1 is a figure showing PGR proofreading activity assay using Pfii DNA polymerase 
mutants according to some embodiments of the invention. 

25 Figure 2 is a figure showing PGR performance of Pfii plus Pfu G387P mutant blends 

according to some embodiments of the invention. 

Figure 3 is a figure showmg PGR performance of Taq plus Pfii G387P mutant blends 
according to some embodiments of the invention. 
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Figure 4 is a figure showing PGR accuracy of PfuTurbo with different amount of 
PfuG387P according to some embodiments of the invention. 

Figure 5 is a figure showing PGR accuracy of PfuTurbo plus PfuG387P according to 
some embodiments of the invention. 

5 Figure 6 is a figure showing the error rate of Taq plus PfuG387P according to some 

embodiments of the invention. 

Figure 7 is a figure showing the polypeptide and polynucleotide sequences of wild-type 
DNA polymerases and mutant DNA polymerases according to some embodiments of the 
invention. 

1 0 DETAILED DESCRIPTION OF THE INVENTION 

The subject invention provides novel composition for hig^i fidelity polynucleotide 
synthesis, particularly DNA synthesis. The subject compositions comprise an enzyme mixture 
for DNA synthesis comprising a fiirst enzyme and a second enzyme, where the first enzyme 
comprises a DNA polymerization activity, and the second enzyme comprises a 3'-5' exonuclease 
1 5 activity and a reduced DNA polymerization activity. In addition to providing high fidelity for 
DNA synthesis, the compositions of the subject invention prevent side effects of a high 
polymerization activity, therefore, increase the efficiency of the amplification compared to a 
mixture in which both DNA polymerases possess wild-type polymerization activities. 

Definitions 

20 As used herein, "synthesis" refers to any in vitro method for malcing new sti-and of 

polynucleotide or elongating existing polynucleotide (i.e., DNA or RNA). Synthesis, according 
to the invention, include ampUfication, which increases the number of copies of a polynucleotide 
template sequence with the use of a polymerase. Polynucleotide synthesis (e.g., ampUfication) 
results in the incoiporation of nucleotides into a polynucleotide (i.e., a primer), thereby forming a 

25 new polynucleotide molecule complementary to the polynucleotide template. The formed 
polynucleotide molecule and its tenq^late can be used as templates to synthesize additional 
polynucleotide molecules. 
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"DNA synthesis", according to the invention, includes, but are not limited to PGR, 
reverse transcription, tlie labelling of polynucleotide (i.e., for probes and oligonucleotide 
primers), polynucleotide sequencing. 

As used herein, the term "template dependent manner" is intended to refer to a process 
5 ttiat involves the template dependent extension of a primer molecule (e.g., DNA synthesis by 
DNA polymerase). The term "template dependent manner" refers to polynucleotide synthesis of 
KNA or DNA wherein the sequence of the newly synthesized strand of polynucleotide is dictated 
by tite well-known rules of complementary base pairing (see, for example, Watson, J. D. et al.. 
In: Molectdar Biology of the Gene. 4th Ed., W. A. Benjamin, Lie, Menio Park, Calif. (1987)). 

-40 As used herein, "polynucleotide polymerase" refers to an enzyme that catalyzes the 

polymerization of nucleotide (i.e., the polymerase activity). Generally, the enzyme will initiate 
synthesis at the 3'-end of the primer annealed to a polynucleotide template sequence, and will 
proceed toward tiie 5' end of the template strand. "DNA polymerase" catalyzes the 
polymerization of deoxynucleotides. 

15 As used herein, the "polymerase domain" refers to the one or more domains of a DNA 

polymerase which is critical for its polymerization activity. The position of the polymerase 
domain varies, for example, the polymerase domain for Pfu, Tgo, KDO, Tli (V ent) and PGB-D 
(dee Vent) are located at amino acid positions as described in Table 2B. 

As used herein, the "partitioning domain" refers to a donaain of a DNA polymerase which 
20 plays a critical role in coordinating the balance between synthesis and degradation of the DNA 
chain. Generally the partitioning domain is characterized by the YXGG motif (Truniger et al., 
1996, EMBO J. 15:3430-3441). This region is located within an accessible loop connecting the 
3 '-5' exonuclease and polymerase domains. The position of the partitioning domain varies. For 
example, the partitioning domain for Pfti, Tgo, KDO, Tli (Vent) and PGB-D (dee Vent) are 
25 located at amino acid positions 384-389, 383-388, 383-388, 386-391, and 384-389 repectively. 

Accordmg to the invention, another class of DNA polymerase is "reverse transcriptases", 
also referred to as "RP', is a critical enzyme responsible for the synthesis of cDNA from vural 
RNA for all retroviruses, including HIV, HTLV-I, HTLV-H, FeLV, FIV, SIV, AMV, MMTV, 
and MoMuLV. For review, see e.g. Levin, 1997, CeU, 88:5-8; Brosius et al., 1995, Virus Genes 
30 1 1:163-79. The term "reverse transcriptase (RT) activity" means the ability to synthesize cDNA 
from RNA template. Methods for measuring RT activity are well known in the art, for example, 

•10 
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the Quan-T-RT assay system is conunercially available from Amersham (Arlington Heights, 111.) 
and is described in Bosworth, et al., Nature 1989, 341:167-168. 

As used herein, a mutant DNA polymerase witli "reduced polymerization activity" is a 
DNA polymerase mutant comprising a DNA polymerization activity which is lower than that of 
the wild-type enzyme, e.g., comprising less than 10% DNA (e.g., less than 8%, 6%, 4%, 2% or 
less than 1 %) polymerization activity of that of the wild-type enzyme. 

As used herein, "exonuclease*' refers to an enzyme that cleaves bonds, preferably 
phosphodiester bonds, between nucleotides one at a time from the end of a DNA molecule. An 
exonuclease can be specific for the 5* or 3* end of a DNA molecule, and is referred to herein as a 
5' to 3' exonuclease or a 3* to 5' exonuclease. A useful exonulcease according to the invention is 
a 3' to 5' exonuclease which degrades DNA by cleaving successive nucleotides from the 3' end 
of the polynucleotide. During the synthesis or amplification of a polynucleotide template, a 
DNA polymerase with 3 ' to 5' exonuclease activity (exo"*^ has the capacity of removing 
mispaired base (proofreading activity), therefore is less error-prone than a DNA polymerase 
without 3' to 5* exonuclease activity (exo ). The exonuclease activity can be defined by methods 
well known in the art. For example, one unit of exonuclease activity may refer to the amount of 
enzyme required to cleave 1 ^ig DNA target in an hour at 37°C. Wild type Tth DNA polymerase 
and Taq DNA polymerase are "exo"" because they do not have 3' to 5' exonuclease activities, 
however, wild type Pfu DNA polymerase, E. coli DNA polymerase I, T7 DNA polymerase, Tma 
DNA polymerase, TU DNA polymerase, KOD DNA polymerase, JDF DNA polymerse, and 
PGB-D DNA polymerase are "exo"*^" because tliey all exhibit 3 ' to 5' exonuclease activity. 

The term "fidelity" as used herein refers to the accuracy of DNA polymerization by 
template-dependent DNA polymerase. The fideUty of a DNA polymerase is measured by the 
error rate (the frequency of incorporating an inaccurate nucleotide, i.e., a nucleotide that is not 
incorporated at a template-dependent manner). The accuracy or fidelity of DNA polymerization 
is maintained by both the polymerase activity and the 3 '-5' exonuclease activity of a DNA 
polymerase. The term "high fideUty" refers to an error rate of 5 x 10"^ per base pair or lower. 
The fidehty or error rate of a DNA polymerase may be measured using assays known to the art 
(see for example, Lundbiurg et al., 1991 Gene, 108:1-6). 

As used herein, an "amplified product" refers to the double strand polynucleotide 
population at the end of a PGR amphfication reaction. The amplified product contains the 
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original polynucleotide template and polynucleotide synthesized by DNA polymerase using the 
polynucleotide template during the PGR reaction, 

As used herein, "polynucleotide template" or "target polynucleotide template" refers to a 
polynucleotide containing an amplified region. The "amplified region," as used herem, is a 
5 region of a polynucleotide that is to be either synthesized by reverse transcription or amplified by 
polymerase chain reaction (PGR). For example, an amplified region of a polynucleotide 
template resides between two sequences to which two PGR primers are complementary to. 

As used herein, the term "primer" refers to a single stranded DNA or RNA molecule that 
can hybridize to a polynucleotide template and prime enzymatic synthesis of a second 
10 polynucleotide strand. A primer useful according to the invention is between 10 to 100 
nucleotides in length, preferably 17-50 nucleotides in length and more preferably 17-45 
nucleotides in length. 

"Gomplementary" refers to the broad concept of sequence complementarity between 
regions of two polynucleotide strands or between two nucleotides through base-pairing. It is 
15 known that an adenine nucleotide is capable of forming specific hydrogen bonds ("base pairing") 
with a nucleotide which is thymine or uracil. Similarly, it is known that a cytosine nucleotide is 
capable of base pairing with a guanine nucleotide. 

The term "vrild-type" refers to a gene or gene product which has the characteristics of 
that gene or gene product when isolated fi^om a naturally occurring source. In contrast, the term 
20 "modified" or "mutant" refers to a gene or gene product which displays altered characteristics 
when compared to the wild-type gene or gene product. For example, a mutant DNA polymerase 
in the present invention is a DNA polymerase which exhibit a reduced DNA polymerization 
activity. 

As used herein, an "enzyme mixture" according to the invention, comprises a first 
25 enzyme comprismg DNA polymerization activity and a second enzyme comprising a 3 '-5 ' 
exonuclease activity and a reduced DNA polymerization activity. The ratio of the DNA 
polymerase activity and the exonuclease activity in the enzyme mixture is about (2,5-5U of DNA 
polymerization activity)/(0.05- 1 OU of 3 '-5 ' exonulcease activity). 
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As used herein, the term "enzyme blend" refers to an enzyme composition comprising 
two or more premixed enzymes. The "enzyme blend" may further comprise other reagents, such 
as PGR enhancing factor, enzyme storage buffer, or reaction buffer. 

Useful DNA Polymerases And Reverse Transcriptases 

DNA polymerases and their properties are described in detail in, among other places, 
DNA Replication 2nd edition, Komberg and Baker, W. H. Freeman, New York, N.Y. (1991). 

Known conventional DNA polymerases include, for example, Pyrococcus furiosus (Pfli) 
DNA polymerase (Lxmdberg et al., 1991, Gene, 108:1, provided by Stratagene), Pyrococcus 
woesei (Pwo) DNA polymerase (Hinnisdaels et al., 1996, Biotechniques, 20:186-8, provided by 
BoehringerMarmheim), Tlterimis thennophiltis (Tth) DNA polymerase (Myers and Gelfand 
1991, Biochemistry 30:7661), Bacillus stearothermophilus DNA polymerase (Stenesh and 
McGowan, 1977, Biochim Biophys Acta 475:32), Thermococcus litoralis (Tli) DNA polymerase 
(also referred to as Vent DNA polymerase, Cariello et al., 1991, Polynucleotides Res ^ 19: 4193, 
provided by New England Biolabs), 9°Nm DNA polymerase (discontinued product from New 
England Biolabs), Tlxermotoga maritima (Tma) DNA polymerase (Diaz and Sabino, 1998 Braz 
J, Med, Res, 31:1239), Thennus aquaticus (Taq) DNA polymerase (Chien et al., 1976, J, 
Bacteoriol, 127: 1550), Pyrococcus kodakaraensis KOD DNA polymerase (Takagi et al., 1997, 
AppL Environ, MicrobioL 63:4504), JDF-3 DNA polymerase (from thermococcus sp. JDF-3, 
Patent application WO 0132887), Pyrococcus GB-D (PGB-D) DNA polymerase (also referred as 
Deep- Vent DNA polymerase, Jimcosa-Ginesta et al., 1994, Biotechniques, 16:820, provided by 
New England Biolabs), UlTma DNA polymerase (from thennophile Thermotoga maritima; Diaz 
and Sabino, 1998 Braz J. Med, Res, 31:1239; provided by PE Applied Biosystems), Tgo DNA 
polymerase (from thermococcus gorgonarius, provided by Roche Molecular Biochemicals), 
coli DNA polymerase I (Lecomte and Doubleday, 1983, Polynucleotides Res. 11:7505), T7 DNA 
polymerase (Nordstrom et aL, 1981, J. BioL Chem. 256:3112), and archaeal DP1/DP2 DNA 
polymerase H (Cann et al., 1998, Proc Natl Acad Sci U S A 95:14250-5). The polymerization 
activity of any of the above enzymes can be defined by means well known in the art. One imit of 
DNA polymerization activity of conventional DNA polymerase, according to the subject 
invention, is defined as the amoimt of enzyme which catalyzes the incorporation of 10 nunoles of 
total deoxyaucleotides (dNTPs) into polymeric form in 30 minutes at optimal tOTiperature (e.g., 
72*'C for Pfii DNA polymerase). Assays for DNA polymerase activity and 3'-5' exonuclease 
activity can be found in DNA Replication 2nd Ed .. Komberg and Baker, supra; Enzvmes , Dixon 
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and Webb, Academic Press, San Diego, Calif. (1979), as well as other publications available to 
the person of ordinary skill in the art. 

When using the subject compositions in reaction mixtures that are exposed to elevated 
temperatures, e.g., during the PGR technique, use of thermostable DNA polymerases is 
5 preferred. 

Reverse transcriptases useftil according to the invention include, but are not limited to, 
reverse transcriptases from HIV, HTLV-1, HTLV-E, FeLV, FIV, SIV, AMV, MMTV, 
MoMuLV and other retroviruses (for reviews, see for example. Levin, 1997, Cell, 88:5-8; 
Verma, 1977, Biochim Biophys Acta. 473:1-38; Wu et al., 1975, CRC Crit Rev Biochem. 3:289- 
10 . 347). 

Useful First Enzvme Comprising DNA Polymerization Activity 

Enzymes comprising DNA polymerization activity according to the present invention 
include eixzymes such as DNA polymerases and reverse transcriptases. 

The first enzyme used in tfie subject composition can be any DNA polymerase, with or 
15 without a proofreading activity. Preferably, a wild type DNA polymerase is used. However, a 
mutant DNA polymerase can also be used so long as it provides sufficient DNA polymerization 
activity required for an amplification reaction. 

In a preferred embodiment, the first enzyme comprising DNA polymerization activity is a 
wild type Pfu DNA polymerase. The enzyme mixture comprising a Pfii DNA polymerase as the 
20 first enzyme is also referred to as a Pfii blend herein after. 

In preferred embodiments of the invention, a Pfu blend enzyme mixture is used for DNA 
synthesis reaction, e.g., PCR reaction. 

In another preferred embodiment, the first enzyme comprising DNA polymerization 
activity is a wild type Taq DNA polymerase. The eazyme mixture comprising a Taq DNA 
25 polymerase as tire first enzyme is also referred to as a Taq blend herein after. 

In preferred embodiments of flie invention, a Taq blend enzyme mixture is used for DNA 
synthesis reaction and for subsequent direct cloning, e.g., PCR reaction followed by TA cloning. 
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111 one embodiment, tlie first enzyme exists in the fomi of an enzyme blend. This anzyme 
blend is mixed with a second enzyme comprising a reduced polymerization activity to produce 
an enzyme mistui e of the rpesent invention. 

hi a preferred embodiment, the enzyme blend is a Herculase® Enhanced or a Herculase® 
Hotstart DNA polymerase (Stratagene, Cat. No. 6003 10 or 600260). The enzyme blend can also 
be selected from commericially available enzyme blend, for example, from the group consisting 
of: EXL DNA Polymerase (Stratagene, Cat, No. 6003420/2/4), YieldAce DNA Polymerase 
(Stratagene, Cat. No. 600290/2/4), TaqPlus Precision PCR System (Stratagene, Cat. No. 
600210/1/2), TaqPlus Long lOOU (Stratagene, Cat, No, 600203/4/5), Advantage 2 PCR Enzyme 
System (BD Biosciences-Clontech, Cat No. 8430-1), Advantage-GC 2 (BD Biosciences- 
Clontech, Cat No, 8433-1), Advantage-HF 2 (BD Biosciences-Clontech, Cat No. K1914-y/l), 
BIO-X-ACT DNA Polymerase ^iohne. Cat. No. BIO-2 1049/50), TripleMaster PCR System 
(Brinkmann, Cat. No. 0032-008-216/24/32), FailSafe PCR System (Epicentre, Cat. No. 
FS99060/100/250/1K), MasterAmp Extra-Long PCR Kit (Epicentre, Cat. No, 
MHF9220/QU92125/QU92500QU9201K), Synergy DNA Polymerase (GeneCraft, Cat No. GC- 
005), SynergyN DNA Polymerase(GeneCraft, Cat No. GC-028), SynergyPlus DNA Polymerase 
(GeneCraft, Cat No. GC-048), Takara ExTaq DNA Polymerase (Intergen, Cat. No. 
RROOl A/B/C), PCR SuperMix High Fidelity (Invitrogen, Cat. No. 10790020), Elongase Enzyme 
Mix (Invitrogen, Cat. No. 10481018), Takara ExTaq DNA Polymerase (PanVera, Cat. No. TAK 
RROOl A/B/C), Takara LATaq DNA polymerase (PanVera, Cat No. TAK RR002MZB/C), 
Expand Hi^ Fidelity PCR System (Roche Molecular Biochemicals, Cat. No. 1 732 
641/650/078), Expand Long Template PCR System (Roche Molecular Biochemicals, Cat. No. 1 
681 834/842; 1 7659 060), Expand 20 kb PLUS PCR System (Roche Molecular Biochemicals, 
Cat. No, 1 811 002), GC-RICHPCR System (Roche Molecular Biochemicals, Cat. No. .2 140 
306), AccuTaq LA DNA Polymerase (Sigma-Aldrich, Cat No. D8045), KlenTaq LA DNA 
Polymerase mix (Sigma-Aldrich, Cat. No. D5062), ProoSprinter DNA Polymerase Mix 
(Thermo Hybaid, Cat. No. PROOFMDClOO/300/600) and ProofExpander PCR Kit (Theraio 
Hybaid, Cat. No. EXPANDIOO). 

The enzyme mixture of the present invention may comprise three DNA polymerases 
comprising a first, a second and a third DNA polymerases. The first DNA polymerase is a DNA 
polymerase with wild-type DNA polymerization activity, e.g., a wild-type Taq DNA polymerase 
or a wild-type Archaeal DNA polymerase. The second and the third DNA polymerases are 
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mutant DNA polymerases comprising a 3'-5' exonuclease activity and a reduced DNA 
polymerization activity. Preferably, the second and the third DNA polymerases are different 
DNA polymerases. More preferably, the second and the third DNA polymerases are different 
DNA polymerases selected from the group consisting of: mutant Pfu DNA polymerase, mutant 
5 Tgo DNA polymerase, mutant KOD DNA polymerase, mutant Vent DNA polymerase, and 
mutant Deep Vent DNA polymerase. More preferably, the mutant Pfu DNA polymerase 
comprises one or more mutations at amino acid positions selected from the group consisting of: 
D405, Y410, T542, D543, K593, Y595, Y385, G387, and G388; the mutant Tgo DNA 
polymerase comprises one or more mutations at amino acid positions selected from tlie group 

10 consisting of: D404, Y409, T541, D542, K592, Y594, Y3S4, G386, and G387; the mutant KOD 
DNA polymerase comprises one or more mutations at amino acid positions selected from the 
group consisting of: D404, Y409, T541, D542, K:592, Y594, Y384, G386, and G387; the mutant 
Vent DNA polymerase comprises one or more mutations at amino acid positions selected from 
the group consisting of: D407, Y412, T544, D545, K595, Y597, Y387, G389, and G390; the 

15 mutant Deep Vent DNA polymerase comprising one or more mutations at amino acid positions 
selected from the group consisting of: D405, Y410, T542, D543, K593, Y595, Y385, G387, and 
G388. The two mutant DNA polymerases comprising a 3 '-5' exonuclease activity and a reduced 
DNA polymerization activity are preferably having different 3 '-5' exonuclease specificity. For 
example, they may be a mutant JDF-3 DNA polymerase (e.g., G387) and a mutant Pfti DNA 

20 polymerase (e.g., G3 87). The mixture comprising two mutant DNA polymerases as such will 
enhance the proofreading activity of flie mixture because JDF and Pfii have diffarent 
proofreading spectra so that they can complement with each other to achieve better fidelity of the 
amplification reaction. It is understood that ttie combination of the second and the third enzymes 
in the mixture of the present invention is not limited to tiie three enzymes listed in this example. 

25 With the scope of the present invention, any two mutant Archaeal DNA polymerases can be used 
in die same mixture with a first DNA polymerase (Archaeal or non- Archaeal DNA polymerase). 
Some non-limiting examples of such three-enzyme mixtures include: a wild-^e Taq DNA 
polymerase, a JDF-3 mutant and a Pfti mutant; a wild-type Taq DNA polymerase, a KOD mutant 
and a Pfu mutant; a wild-type JDF DNA polymerase, a JDF-3 mutant and a Pfu mutant; a wild- 

30 type Pfii DNA polymerase, a JDF-3 mutant and a Pfii mutant; a wild-type KOD DNA 

polymerase, a JDF-3 mutant and a Pfu mutant; a wild-type KOD DNA polymerase, a KOD-3 
mutant and a Pfu mutant; a wild-type Pfii DNA polymerase, a JDF-3 mutant and a Pfu mutant; a 
wild-type Pfu DNA polymerase, a KOD mutant and a Pfu mutant. 
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Useful Second Eiizvme Comprising 3 '-5' Exonuclease Activity 

Enzyme comprising 3 '-5' exonuclease activity (i.e., proofreading DNA polymerase) 
according to the invention include, but are not limited to, DNA polymerases, E. coli exonuclease 
I, E. coli exonuclease III, E. coli recBCD nuclease, mung bean nuclease, and the like (see for 
example, Kuo, 1994, Aim N Y Acad Sci., 726:223-34). 

Any proofreading DNA polymerase could be mutagenized to reduce/eliminate DNA 
polymerase activity and vised in the enzyme reaction of the present invention. Examples can be 
found in many DNA polymerase families including, but are not limited to such as follows: 

Familv B DNA polymerases 

Bacteriophage T4 DNA polymerase, <})29 DNA polymerase, T7 DNA polymerase; E, coli 
pol n DNA polymerase; human DNA polymerase 5, human DNA polymerase y, archaeal DNA 
polymerase I (Table 1). 

Eubacterial Family A DNA polymerases (with proofreading activity) 

E. coli DNA pol I (Klenow fragment), Thermotoga maritima (UlTma fragment) 

Familv D DNA polymerases (unrelated to Families A, B. O 

Archaeal DNA polymerase n (DP1/DP2) e.g., as described in Cann et al (1998) PNAS 
95:14250-5. 

Table 1. Accession Information for Cloned Family B Polymerases 

Vent Thermococcus litoralis 
ACCESSION AAA72101 
PID g348689 

VERSION AAA72101.1 GI:348689 
DBSOURCE locus THCVDPE accession M74198.1 
THEST THERMOCOCCUS SP. (STRAIN TY) 
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ACCESSION 033845 

PID g39 13524 

VERSION 033845 GI:3913524 

DBSOURCE swissprot: locus DPOL_THEST, accession 033845 
5 Pab Pyrococcus abyssi 

ACCESSION P77916 
PID §3913529 
VERSION P77916 GI:3913529 

DBSOURCE swissprot: locus DPOL_PYRAB, accession P779 1 6 
10 PYRiHO Pyrococcus horikoshii 

ACCESSION 059610 
PID g3913526 
VERSION 059610 GI:3913526 

DBSOURCE swissprot: locus DPOL_PYRHO, accession 059610 
1 5 P YRSE PYROCOCCUS SP. (STRAIN GE23) 

ACCESSION P77932 
PID g3913530 
VERSION P77932 GI:3913530 

DBSOURCE swissprot: locus DPOL_PYRSE, accession P77932 
20 Deep Vent Pyrococcus sp. 

ACCESSION AAA67131 
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PID g436495 

VERSION AAA67131.1 GI:436495 

DBSOURCE locus PSU00707 accession U00707.1 

Pfii Pyrococcus furiosus 

ACCESSION P80061 

PID §399403 

VERSION PS0061 GI:399403 

DBSOURCE swissprot: locus DPOL_PYRFU, accession P80061 

JDF-3 Thermococcus sp. 

Unpublished 

Baross gi|2097756|pat|US|5602011|12 Sequence 12 from patent US 5602011 

9degN THERMOCOCCUS SP. (STRAIN 90N-7). 

ACCESSION Q56366 

PID g3913540 

VERSION Q56366 GI:3913540 

DBSOURCE swissprot: locus DPOL_THES9, accession Q56366 
KOD Pyrococcus sp. 
ACCESSION BAA06142 
PID gl620911 

VERSION BAA06142.1 01:1620911 

DBSOURCE locus PYWKODPOL accession D29671 . 1 
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Tgo Thermococcus gorgonarius. 

ACCESSION 4699806 ' 

PID g4699806 

VERSION GI:4699806 
5 DBSOURCE pdb: chain 65, release Feb 23, 1999 

THEFM Themiococcus fumicolans 

ACCESSION P74918 

PID g3913528 

VERSION P74918 GI:3913528 
10 DBSOXJRCE swissprot: locus DPOL_THEFM, accession P749 18 

METTH Methanobacterium thermoautotrophicum 

ACCESSION 027276 

PID g3913522 

VERSION 027276 GI:3913522 
1 5 DBSOURCE swissprot: locus DPOL_METTH, accession 027276 

Metj a Methanococcus j annaschii 

ACCESSION Q58295 

PID §3915679 

VERSION Q58295 GI:3915679 
20 DBSOURCE swissprot: locus DPOL_METJA, accession Q5 8295 

POC Pyrodictium occultum 



wo 03/()6(H44 



PCT/US02/40423 



ACCESSION B56277 

PID gl 363344 

VERSION B56277 GI: 1363344 

DBSOURCE pin locus B56277 

Apel Aeropynun pemix 

ACCESSION BAAS 1109 

PID §5105797 

VERSION BAAS 1109.1 GI:5 105797 

DBSOURCE locus AP000063 accession AP000063.1 

ARCFU Archaeoglobus fulgidus 

ACCESSION 029753 

PID g3 1220 19 

VERSION 029753 GI:3 122019 

DBSOURCE swissprot: locus DPOL_ARCFU, accession 029753 
Desulfurococcus sp. Tok. 
ACCESSION 6435708 
PID g64357089 
VERSION GT:6435708 

DBSOURCE pdb. chain 65, release Jun 2, 1999 

Enzymes possessing 3-5' exonuclease activity for use in the present compositions and 
methods may be isolated from natural sources or produced through recombinant DNA 
techniques. Preferably, tiie enzyme comprising 3*-5' exonuclease activity is a DNA polymerase. 
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A DNA polymerase comprising 3 '-5' exonuclease activity (referred as exo^ is capable of 
proofreading the incorporated nucleotides produced by its own polymerization activity. Among 
other applications, exo^ DNA polymerases are particularly suited for cloning of PGR products, 
characterization of polynucleotide sequences. Useful exo"^ DNA polymerases include, but are 
5 not limited to, Pwo DNA polyinerase; Vent DNA polymerases; Deep Vent DNA polymerase; 
P'^Nm DNA polymerase; UlTma DNA polymerase; Tli DNA polymerase; Pfu DNA polymerase; 
JDF--3 DNA polymerase; Tgo DNA polymerase; KOD DNA polymerase; and PGB-D DNA 
polymerase. 

In preferred embodiments of the subject invention, an exo"*" DNA polymerase with 
10 reduced DNA polymerization activity is used as the second enzyme. 

Preparing Exo"*^ DNA Polvmerase With Reduced DNA Polym erization Activity 

The cloned wild-type Exo"*" DNA polymerase may be modified to generate forms 
exhibiting reduced polymerization activity by a number of methods. These include the methods 
described below and other methods known in the art. Any exo"^ DNA polymerase can be used to 
1 5 prepare for the exo^ DNA polymerase with reduced DNA polymerization activity in the 
invention. 

A. Genetic Modifications - Mutagenesis 

Tlie preferred method of preparing a DNA polymerase with reduced polymerization 
activity is by genetic modification (e.g., by modifying the DNA sequence of a wild-type DNA 

20 polymerase). Within the sequence of an exo"*^ DNA polymerase, the preferred sequence for 

genetic modification is the DNA sequence encoding the polymerization domain. Polymerization 
and exonuclease domains (i.e., their crystal structures) of many DNA polymerases are known in 
the art (for examples, see Rodriguez et al., 2000, J. Mol. Biol. 299:447-62; Zhao et al., 1999, 
Structure Fold Des. 7:1 189-99; Baker et al., 1998, Proc Natl Acad Sci USA. 95:3507-12; Kiefer 

25 et al., 1997, Stmcture 5:95-108; Kim et al., 1995, Nature, 376:612-6; Kong et al., 1993, J Biol 
Chem. 268:1965-75). 

General structure features of DNA polymerization domain is known in the art. For 
example, Blanco et al. (1991, Gene, 100:27-38) discloses that significant amino acid (aa) 
sequence similarity has been found in the C-terminal portion of 27 DNA-dependent DNA 
30 polymerases belonging to the two main superfamilies: (i) Escherichia coli DNA polymerase I 
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(Poir)-lilce prokaryotic DNA polymerases,. and (ii) DNA polymerase alpha-like prokaiyotic and 
eulcaryotic (viral and cellular) DNA polymerases. The six most conserved C-terminal regions, 
spanning approximately 340 ammo acids, are located in the same linear arrangement and contain 
highly conserved motifs and critical residues involved in the polymerization function. 

According to tlie tlH-ee-dimensional model of Pollk (Klenow fragment), tliese six 
conserved regions are located in the proposed polymerization domain, forming the metal and 
dNTP bindmg sites and the cleft for holding the DNA template. Site-directed mutagenesis 
studies support these structural predictions. 

The 3-5' exonuclease active site of E. coh DNA polymerase I is predicted to be 
conserved for both prokaryotic and eukaryotic DNA polymerases based on amino acid sequence 
homology (Bemad et al., 1989, Cell, 59:219-28). Three amino acid regions containing the 
critical residues in the E. coli DNA polymerase I involved in metal binding, single-stranded 
DNA binding, and catalysis of the exonuclease reaction are located in the amino-terminal half 
and in the same linear arrangement in several prokaryotic and eukaryotic DNA polymerases. 
Site-directed mutagenesis at the predicted exonuclease active site of the phi 29 DNA polymerase, 
a model enzyme for prokaryotic and eukaryotic alpha-like DNA polymerases, specifically 
inactivated the 3 '-5* exonuclease activity of the enzyme. These results reflect a high evolutionary 
conservation of this catalytic domain. 

With the great availability of sequences from DNA polymerases, it has become possible 
to delineate a few higjily conserved regions for various polymerase types (for review, see for 
example, Johnson, 1993, Annu Rev Biochem. 62:685-713). Delarae et al. reported an approach 
for unifying the structure of DNA polymerase (1990, Protein Eng., 3:461-7). The speculative 
hypothesis should provide a useful model to direct genetic modifications for preparing DNA 
polymerase with reduced polymerization activity. 

Preferably, the genetic modification for preparing exo^ DNA polymerase with reduced 
polymerization activity does not significantly reduces its 3 '-5' exonuclease activity (i.e., the 
proofreading ativity). 

Known DNA polymerase mutants that selectively reduce DNA polymerization activity 

can be found in the art, for example, in Blanco et al., 1995 Methods of Enzymology 262:283-294 

((Bacteriophage <|)29); Truniger et al., 1996, EMBO J. 15:3430-3441 (Bacteriophage <|)29); 

Abdus Sattar et aL,1996, Biochemistry 35:16621-9 (Bacteriophage T4); Tuske et al, 2000, J. 
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Biological Chemistry 275:23759-68 (Klenov/ fragment); Bolilke et al., 2000, Nucleic Acid 
Research 28:3910-3917 (Themococcus aggregans); Pisani et al., 1998, Biochemistry 37:15005- 
15012 (Sulfolobus solfataricus); Komori et al., 2000, Protein Eng 13:41-7 (Pyirococcus furiosus); 
Shen et al., 2001 J. Biological Chemistry 276:27376-83 (Pyrococcus horikoshi Family D). 

5 Site-directed mutagenesis of bacteriophage <}>29 DNA polymerase leads to the 

identification of mutations in the polymerase domain which reduce DNA polymerase activity, 
while having minimal effects on 3 '-5' exonuclease activity (Blanco, L. and Salas, M. 1995, 
Methods of Enzymology 262:283-294). In one embodiment of the invention, one or more 
conesponding amino acids in Pfii DNA polymerases are mutated (e.g., by substitutions: D405B, 
10 Y410F, T542P, D543G, K593T, Y595S). It is undCTStood that other amino acid side 

substitutions at these same sites would also selectively reduce DNA polymerase activity. 

The ^29 DNA polymerase mutagenesis studies targeted amino acid residues witiiin 
highly conserved Family B motifs (DXXSLYP [SEQ ID NO. 1], KXXXNSXYG [SBQ ID NO. 
2], TXXGR [SEQ ID NO. 3], YXDTDS [SEQ ID NO. 4], and KXY [SEQ ID NO. 5]). although 

1 5 other regions of the protein presumably can be mutagenized to selectively decrease DNA 

polymerase activity. One such region is the partitioning domain, characterized by the YXGG 
[SEQ ID NO. 6] motif (Truniger et al., 1996, EMBO J. 15:3430-3441). This region is located 
witliin an accessible loop connecting the 3'-5' exonuclease and polymerase domains. The 
partitioning domain plays a critical role in coordinating tiie balance between synthesis and 

20 degradation of the DNA chain. Mutations within this region disrupt the equilibrium between 
polymerization and proofreading, and produce phenotypes favoring either polymerization 
(reduced proofreading) or proofreading (reduced polymerization). 

Non-conservative (S,N) substitutions at Y387 (equivalent to Y385 in Pfii) in the partitioning 
domain of the archaeal Thermococciis aggregans DNA polymerase lead to a significant 

25 reduction in DNA polymerase activity and enhanced exonuclease activity, which results in 

improved enzyme fidelity (used alone in PCR) (Bohlke, K. et al (2000) NAR 28:3910-3917). In 
contrast, conservative substitutions at Y387 (F, W, H) lead to wild-type-like fidelity and enhanced 
PCR performance, which may be related to improved polymerization. A G389A mutation 
(equivalent to Pfu G387) in Tliermococcus aggregans DNA polymerase lead to reduced DNA 

30 polymerase activity (10% wt), increased exonuclease activity (236% wt), and loss of product 

synthesis in PCR (Bohlke, K. et al (2000) NAR 28:3910-3917). Analogous mutations have been 
investigated in bacteriophage <|)29 DNA polymerase (Truniger, V., et al (1996) EMBO J. 
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15:3430-3441) and in the archaeal Solfolobiis solfataricus (Sso) DNA polymerase (Pisani, F,M., 
DeFelice, M., and Rossi, M. (1998) Biochemistiy 37:15005-15012), where a G->A mutation 
either decreases (pol/exo = 0.6 for Sso) or increases (pol/exo = 91 for <|)29) DNA polymerase 
activity relative to exonuclease activity. 

5 In one embodiment of the invention, Pfu DNA polymerase was mutated witliin the 

partitioning domain at amino acids 384-389 (SYTGGF [SEQ ID NO. 7]) to obtain a Pfu DNA 
polymerase with reduced polymerization activity. It is understood that other amino acid side 
substitutions within the partitioning domain, e.g., at positions Y385, G387, G388, could also 
selectively reduce DNA polymerase activity while having minimal effects on exonuclease 
10 activity. 

In anotlier embodiment, two or mutations are combined (e.g., by introducing additional 
site-directed mutations into a mutant Pfu DNA polymerase) to effectively eliminate DNA 
polymerase activity, while retaining high levels of proofreading activity. 

U.S. Patent Nos. 5,691,142, 5,614,402 and 5,541,311 disclose methods of deriving 5'-3' 
1 5 nucleases from thermostable DNA polymerases for the detection of target polynucleotide 
molecules (hereby incorporated by reference). These methods can be applied to the subject 
invention for preparing DNA polymerase comprising 3 '-5' exonuclease activity with a reduced 
polymerization activity. Other techniques for genetic modification are well known in the art (see 
for example, Ausubel et al. Short Protocols in Molecular Biologv (1995) 3^^ Ed. John Wiley & 
20 Sons, Inc.). 

Modification to the primary stmcture of a wild type enzyme by deletion, addition, or 
alteration of the amino acids incorporated into the sequence during translation can be made 
without destroying the high temperature DNA polymerase activity of the protein. Such 
substitutions or other alterations result in proteins usefiil in the methods of the present invention. 
25 The availabiUty of DNA encoding these sequences provides the opportunity to modify the codon 
sequence to generate mutant enzymes having reduced polymerization activity. A few methods 
for altering DNA sequences are provided below, any other method known in the art may also be 
used. 

There are a number of site-directed mutagenesis methods known in the art which allow 
30 one to mutate a particular site or region in a straightforward manner, based on the sequences of 

• 25 ' . 
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the polymerization domain of a DNA polymerase. There are a number of kits available 
commercially for the performance of site-directed mutagenesis, including both conventional and 
PCR-based methods. Examples include the EXSITE™ PCR-Based Site-directed Mutagenesis 
Kit available from Stratagene (Catalog No. 200502) and the QUIKCHANGEtm Site-directed 
5 mutagenesis Kit from Stratagene (Catalog No. 2005 18), and the CHAMELEON® double- 
stranded Site-directed mutagenesis kit, also from Stratagene (Catalog No. 200509). 

Older methods of site-directed mutagenesis known in ttie art rehed upon sub-cloning of 
the sequence to be mutated into a vector, such as an M13 bacteriophage vector, that allows the 
isolation of single-stranded DNA template. In these methods one anneals a mutagenic primer 
1 0 (i.e., a primer capable of annealing to the site to be mutated but bearing one or mismatched 
nucleotides at the site to be mutated) to the single-stranded template and then polymerizes the 
complement of the template starting from the 3' end of the mutagenic primer. The resulting 
diqjlexes are then transformed into host bacteria and plaques are screened for the desired 
mutation. 

1 5 More recently, site-directed mutagaiesis has employed PGR methodologies, which have 

the advantage of not requiring a single-stranded template. In addition, methods have been 
developed that do not require sub-cloning. Several issues must be considered when PCR-based 
site-directed mutagenesis is performed. First, in these methods it is desirable to reduce the 
number of PCR cycles to prevent expansion of undesired mutations introduced by the 

20 polymerase. Second, a selection must be employed in order to reduce the number of non- 
mutated parental molecules persisting in the reactioiL Third, an extended-length PCR method is 
preferred in order to allow the use of a single PCR primer set. And fourth, because of the non- 
template-dependent terminal extension activity of some thermostable polymerases it is often 
necessary to incorporate an end-polishing step into the procedure prior to blunt-end ligation of 

25 the PCR-generated mutant product. 

The protocol described below accommodates these considerations through the following 
stq)s. First, the template concentration used is approximately 1000-fold higher than that used in 
conventional PCR reactions, allowing a reduction in the number of cycles froin 25-30 down to 5- 
10 without dramatically reducing product yield. Second, the restriction endonuclease Dpnl 
30 (recognition target sequence: 5-Gm6ATC-3, where the A residue is methylated) is used to select 
against parental DNA, since most common strains of E. coli Dam methylate their DNA at the 
sequence 5-GATC-3. Third, Taq Extender is used in the PGR mix in order to increase the 
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proportion of long (i.e., full plasmid lengtli) PGR products. Finally, Pfu DNA polymerase is 
used to polish the ends of the. PGR product prior to intramolecular ligation using T4 DNA ligase. 

A non-liniitihg example for the method is described in detail as follows: 

Plasmid template DNA (approximately 0,5 pmole) is added to a PGR cocktail containing: 
Ix mutagenesis buffer (20 mM Tris HCl, pH 7.5; 8 mM MgGk; 40 ixg/ml BSA); 12-20 pmole of 
each primer (one of skill in the art may design a mutagenic primer as necessary, giving 
consideration to those factors such as base composition, primer length and intended buffer salt 
concentrations that affect the annealing characteristics of oligonucleotide primers; one primer 
must contain the desued mutation, and one (the same or the other) must contain a 5' phosphate to 
facilitate later Ugation), 250 pM each dNTP, 2,5 U Taq DNA polymerase, and 2.5 U of Taq 
Extender (Available from Stratagene; See Nielsen et al. (1994) Strategies 7: 27, and U.S. Patent 
No. 5,556,772). Primers can be prepared using the triester method of Matteucci et al., 1981, J. 
Am, Chem. Soc. 103:3185-3191, incorporated herein by reference. Alternatively automated 
synthesis may be preferred, for example, on a Biosearch 8700 DNA Synthesizer using 
cyanoethyl phosphoramidite chemistry. 

The PGR cycling is performed as follows: 1 cycle of 4 min at 94°C, 2 min at 50°G and 2 
min at 72''G; followed by 5-10 cycles of 1 min at 94°G, 2 min at 54°C and 1 min at 72°G. The 
parental template DNA and the linear, PGR-generated DNA incorporating the mutagenic primer 
are treated with Dpnl (10 U) and Pfu DNA polymerase (2.51J). This results in the Dpnl 
digestion of the in vivo methylated parental template and hybrid DNA and the removal, by Pfu 
DNA polymerase, of the non-template-directed Taq DNA polymerase-extended base(s) on the 
linear PGR product. The reaction is incubated at 37°G for 30 min and then transferred to 72°C 
for an additional 30 min. Mutagenesis buffer (1 15 ul of Ix) containing 0.5 mM ATP is added to • 
the DpnI-digested, Pfu DNA polymerase-polished PGR products. The solution is mixed and 10 
ul are removed to a new microfiige tube and T4 DNA ligase (2-4 U) is added. The ligation is 
incubated for greater than 60 mm at 37°G. Finally, the treated solution is transformed into 
competent E. coli according to standard methods. 

Methods of random mutagenesis which will result in a panel of mutants bearing one or 
more randomly-situated mutations exist in the art . Such a panel of mutants may then be 
screened for those exhibiting reduced polymerization relative to the wild-type polymerase (e.g., 
by measuring the incorporation of lOnmoles of dNTPs into polymeric form in 30 minutes at the 
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optimal temperature for a given DNA polymerase). Aii example of a method for random 
mutagenesis is the so-called "error-prone PGR method". As the name implies, the method 
amplifies a given sequence mider conditions in v^hich the DNA polymerase does not support 
high fidelity incorporation. The conditions encouraging enror-prone incorporation for different 
DNA polymerases vary, however one skilled in the art may determine such conditions for a 
given enzyme. A key vaiiable for many DNA polymerases in the fidelity of amplification is, for 
example, the type and concentration of divalent metal ion in the buffer and the inherited fidelity 
of the PGR enzynie. The use of manganese ion and/or variation of the magnesium or manganese 
ion concentration may therefore be appUed to influence the error rate of the polymerase. 

In a preferred embodiment, the second enzyme with reduced polymerization activity is 
derived firom Pfii DNA polymerase. 

The DNA coding sequence of a wild-type Pfix DNA polymerase can be found in the art, 
for example, firom Genbank (accession No. U84155). A detailed description of the stracture and 
fimctiiin of Pfii DNA polymerase can be fovmd, among other places in U.S. Patent Nos. 
5,948,663; 5,866,395; 5,545,552; 5,556,772, all of which thereby incorporated by references. A 
not-limiting detailed procedure for preparing Pfix DNA polymerase with reduced polymerization 
activity is provided in Example 1 . 

A person of average skill in the art having the benefit of tliis disclosure will recognize 
that polymerases with reduced polymerization activity derived from other exo"^ DNA 
polymerases including Vent DNA polymerase, JDF-3 DNA polymerase, Tgo DNA polymerase 
and the like may be suitably used in the subject compositions. 

The first or the second enzyme of the subject composition may comprise DNA 
polymerases that have not yet been isolated. Assays for both DNA polymerization activity and 
3'-5' exonuclease activity can be found in the subject description and in DNA Replication 2nd 
Ed., Romberg and Baker, supra; Enzymes. Dixon and Webb, Supra, as well as other publications 
available to the person of ordinary skill in the art. 

In preferred embodiments of the invention, mutant Pfu DNA polymerase comprises one 
or more mutations at amino acid positions selected from the group consisting of: D405, Y410, 
T542, D543, K593, Y595, Y385, G387, and G388. 
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More preferably, the mutant Pfu DNA polymerase comprises one or more mutations 
selected from the group consisting of: D405E, Y410F, T542P, D543G, K593T, Y595S, Y385Q, 
Y385S, Y3S5N, Y385L, Y385H, G387S, G3S7P, and G388P. 

The invention encompasses compositions and methods in which a mutant of a related 
5 archaeal DNA polymerase is with reduced (e.g., deficient in) polymerase activity, while retaining 
proofreading activity. Such mutations may be within the partitioning domain or the polymerase 
domain of the DNA polymerases. Table 2 (A and B) and Figure 7 provides an unlimited 
example of such mutations in various DNA polynerases. A mutant DNA polymerase of the 
invention may comprise a single mutation as indicted in Table 2, or a combination of any two or 
10 more mutations. 



Table 2A Partitioning Domain Mutations in Various DNA Polymerases 



aizyme 


Domain 
(bp) 


Domain 
sequence 


Predicted Mutations 
for Reducing DNA 
Polymerase Activity* 


Preferred mutation 


Pfu 


384-389 


S YTGGF 


Y385, G387, G388 
(Y385N, Y385L, 
Y385H, Y385Q, 
Y385S; G387S, 
G387P; G388A, 
G388P) 


G387P 


Tgo 


383-388 


S YAGGY 
[SEQIDNO. 
10] 


Y384, G386, G387 
(Y384N, Y384L, 
Y384H, Y384Q, 
Y384S; G386S, 
G386P; G387A, 
G387P) 


G386P 


KOD 


383-388 


S YEGGY 
[SEQBDNO. 
11] 


Y384, G386. G387 
(Y384N, Y384L, 
Y384H, Y384Q, 
Y384S; G386S, 
G386P; G387A, 
G387P) 


G386P 


Vent 


386-391 


TYLGGY 
[SBQ ID NO. 
12] 


Y387, G389, G390 
(Y387N, Y387L, 
Y387H, Y387Q, 
Y387S; G389S, 
G389P; G390A, 


G389P 
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G390P) 




Deep Vent 


384-389 


S Y AGGY 


Y385, G387, G388 
(Y385N, Y385L, 
Y385H, Y385Q, 
Y385S; G387S, 
G387P; G388A, 
G388P) 


G387P 



Table 2B Polymerase Domain Mutations in Various DNA Polymerases 







Domain seauence 


Predicted Mutations for 




Reducing DNA Polymerase 








Activity# 





DXXSLYP 








405-411 


DFRALYP [SEQ 


D405 (D405E) 






ID NO. 13] 


D404 P404E) 




404-410 






DFRSLYP 






404-410 








DFRSLYP 


D407 (D404E) 


- ent 


407-413 








DFR5LYP 


D405 (D404E) 


Deep 


Af\C Alt 




Vent 




DFRSLYP 






YXDTDS 






Pfu 


539-544 


YIDTDG [SEQ 


T542, D543 (T542P; D543G) 






ID NO, 14] 


T^A'i n^49 rTS41P' T>542G\ 


Tgo 


538-543 




YADTDG [SEQ 


T541, D542 (T541P; D542G) 


KOD 


538-543 


ED NO. 15] 


Vent 


541-546 


YSDTDG [SEQ 


T544, D545 (T544P; D545G) 


Deep 


539-544 


ID NO. 16] 


T542, D543 (T542P; D543G) 


Vent 




YADTDG 








YIDTDG 






KXY 






Pfii 


593-595 


KRY [SEQ ID 


K593 (K593T) 






NO. 17] 


K592 (K592T) 


Tgo 


592-594 




KKY[SEQID 


K592(BC592T) 


KOD 


592-594 
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Vent 


595-597 


NO. 18] 


K595 (K595T) 


Deep 


593-595 


KKY 


K593 (K593T) 


Vent 












KRY 








KKY 





#altemative side chaiii substitutions at key positions are also expected to reduce polymerase 
activity 

B . Methods of Evaluating Mutants for Reduced Polymerization 

Random or site-directed mutants generated as known in the art or as described herein and 
5 expressed in bacteria may be screened for reduced polymerization by several different assays. 
Embodiments for the expression of mutant and wild type enzymes is described herein below ja 
section C. In one method, exo^ DNA polymerase proteins expressed in lytic lambda phage 
plaques generated by infection of host bacteria with expression vectors based on, for example:. 
Lambda Zapn^, are transferred to a membrane support. The immobilized proteins are then 
1 0 assayed for polymerase activity on the membrane by immersing the membranes in a buffer 
containing a DNA template and the imconventional nucleotides to be monitored for 
incorporation. 

Mutant polymerase libraries may be screened using a variation of the technique used by 
Sagner et al (Sagner, G., Ruger, R,, and Kessler, C. (1991) Gene 97:1 19-123), For this approach, 

15 lambda phage clones are plated at a density of 10-20 plaques per square centimeter. Proteins 
present in the plaques are transferred to filters and moistened with polymerase screening buffer 
(50mM Tris (pH 8.0), 7mM MgC12, 3mM p-ME). The filters are kept between layers of plastic 
wrap and glass while the host cell proteins are heat-inactivated by incubation at 65°C for 30 
minutes. The heat-treated filters are tlien transferred to firesh plastic wrap and approximately 351 

20 of polymerase assay cocktail are added for every square centimeter of filter. The assay cocktail 
consists of IX cloned Pfu (cPfu) magnesium fi-ee buffer (IX buffer is 20 mM Tris-HCl (pH 8.8), 
10 mM KCl, 10 mM (NH4)2S04, 100 ^g/ml bovine serum albumm (BSA), and 0.1% Triton X- 
100; Pfii Magnesium-firee buffer may be obtained fi^om Stratagene (Catalog No. 200534)), 125 
ng/ml activated calf thymus or salmon speim DNA, 1.29 |xCi/ml a-^^P ddNTP. The filters are 

25 placed between plastic wtj^) and a glass plate and then incubated at 65**C for one hour, and then 
at 70^C for one hour and fifteen minutes. Filters are then washed three times in 2X SSC for five 
minutes per wash before rinsing twice in 1 00% ethanol and vacuum drying. Filters are then 
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exposed to x-ray film (approximately 16 hours), and plaques that incorporate label are identified 
by aligning the filters with the original plate bearing the phage clones. Plaques identified in this 
way are re-plated at more dilute concentrations and assayed under simUar conditions to allow the 
. isolation of purified plaques . 

5 In assays such as the one described above, the signal generated by the label is a direct 

measure of tlie polymerization activity of the polymerase. A plaque comprising a mutant DNA 
polymerase witli reduced DNA polymerization activity compared to that of the wild-type enzyme 
can be selected. 

Incoq)oration of nucleotides may also be measured in extension reactions by adding, for 
10 example, 1 jil of s5)propriately diluted bacterial extract (i.e., heat-tireated and clarified extract of 
bacterial ceUs expressing a cloned polymerase or mutated cloned polymerase) to 10 ill of each 
nucleotide cocktail, followed by incubation at the optimal temperatiore for 30 minutes (e.g., 73°C 
for Pfu DNA polymerase), for example, as described in Hogrefe et al., 2001, Methods in 
Enzymology, 343:91-1 16. Extension reactions are quenched on ice, and tiien 5^ll aUquots are 
15 spotted immediately onto DE81 ion-exchange filters (2.3cm; Whatinan #3658323). 

Unincorporated label is removed by 6 washes witii 2xSCC (0.3M NaCU 30mM sodium citirate, 
pH 7,0), followed by a brief wash with 100% ethanol. Incorporated radioactivity is then 
measured by scintillation counting. Reactions that lack enzyme are also set up along with sample 
incubations to determine "total cpms" (omit filter wash steps) and "minimum cpms"(wash filters 
20 as above). Cpms bound is proportional to the amount of polymerase activity present per volume 
of bacterial extract. 

A Non-limiting method for determining polymerization activity of a DNA polymerase 
mutant relative to wild type (wt) is provided as follows. Relative percent radioactivity 
incorporated which indicates the relative polymerization activity of a DNA polymerase mutant 
25 can be determined as: 

f corrected cpms for mutant DNA polvmR ra.«;e^ x fne wt DNA polymerase) 
(corrected cpms for wt DNA polymerase; X (ng mutant DNA polymerase;. 

To more precisely quantify % activity, one should covert cpms incorporated into units of 
DNA polymerase activity. One unit of polymerase activity is defined as tiie amount of enzyme 
30 that catalyzes flie incorporation of lOnmoles of total dNTP into polymeric form (e.g., binds to 

32 



wo 03/060144 PCT/US02/40423 

DE-SI paper) in 30 minutes at optimal temperature. Units of DNA polymerase activity can be 
calculated using the following equation: 

(corrected sample cpms ) x (Snmoles dNTPs) x (1 unit) 

total cpms reaction (lOnmoles dNTPs incorporated) 

Polymerase specific activity (U/mg) can be extrapolated from the slope of the linear 
portion of units versus enzyme amount plots. Protein concentrations can be determined relative 
to a BSA standard (Pierce) in a colorimetric assay (e.g. Pierce's Coomassie Plus Protein Assay). 
Alternatively, when protein amounts are limiting (or for preparations of limited purifty), relative 
protein concentrations can be verified by SDS-PAGE analysis. Several aliquots of each DNA 
polymerase preparation, ranging from 1-20 ng of total protein, are subject to SDS-PAGE 
electrophoresis and the intensity of silver- and/or Sypro orange (Molecular Probes)-stained bands 
are compared to standards. Finally, % activity can be determined as: 

specific polymerase activity (TJ/mg) of mutant DNA polvmerase 
specific polymerase activity (U/mg) of wt DNA polymerase 

It is preferred tliat the polymerases with reduced polymerization activity of the present 
invention maintain their proofreading activities (i.e., 3 '-5' exonuclease activities). The mutant 
DNA polymerases v^th reduced DNA polymerization activities, therefore, are also assayed for 
3 '-5- exonuclease activities. 

Suitable exonuclease activity assays include one described in Hogrefe et al (supra, and as 
described in Example 3). Another assay employs double-stranded X DNA, which has been 
uniformly labeled with S-adenosyl methionine (NEN #NET-155) and Sss I methylase (NEB), 
and then restriction digested with Pal I (Kong et al., 1993, J. Biol. Chem. 268; 1965). Using 
double-stranded labeled DNA templates, one can determine specificity by measuring whether 
cpms decrease (3 '-5' exonuclease) with the addition of dNTPs (10-lOOfiM). A typical 
exonuclease reaction cocktail consists of Ix reaction buffer and 20|ig/ml ^H-labeled digested 
double-stranded X DNA (~10^ cpms/ml), prepared as described (Kong et al,, supra). 
Exonuclease activity can be measured in flie appropriate PGR buffer or in a universal assay 
buffer such as 70mM Tris HCl (pH 8.8), 2mMMgCl2, ai% Triton-X, and lOOKig/ml BSA. 

Percent exonuclease activity can be determined as: (corrected cpms for 
mutants)/(corrected cpms for wt DNA polymerase). To more precisely quantify % activity, 
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cpms released can be converted into units of exonuclease activity. One unit of exonuclease 
activity is defined as the amount of enzyme that catalyzes the acid-solubUization of lOnmoles of 
total dNMPs in 30 minutes at a defmed temperature. To determine units, background (average 
"minimum cpms" value) is first subtracted fi-om the average sample cpms. Nmoles dNMPs - 
5 released is calculated using the following equation: 

(corrected sample corns') x (ng DNA) x (Inmole dNMP) 
total cpms reaction (330ng dNMP) 

Units of exonuclease activity (in 30 minutes) can then be determined as: 

10 (nmoles dNMPs released per hr) x (1 unit) 

2 (lOimioles dNMPs released) 

Exonuclease specific activity (U/mg) can be extrapolated fi-om the slope of the linear portion of 
units versus enzyme amount plots. Finally, % activity can be determined as: 

15 specific exonuclease activity (U/mg ^ of mutant DNA polymerase 
specific exonuclease activity (U/mg) of wt DNA polymerase 

hi addition to the substrate described above, exonuclease activity can be also be 
quantified using ['H]-5. coli genomic DNA (NEN #NET561; S.S^Ci/^ig), a commercially- 
20 available substrate. A typical exonuclease reaction cocktail coiKists of 25ng/ml ^H-labeled E. 
coli genomic DNA and 975 ng/ml cold E. coU genomic DNA in Ix reaction buffer. Assays are 
performed as described above. 

Genes for desired mutant DNA polymerases generated by mutagenesis may be sequenced 
to identify the sites and number of inutations. For those mutants comprising more than one 
25 mutation, the effect of a given mutation may be evaluated by introduction of the identified 

mutation to the wUd-type gene by site-directed mutagenesis in isolation firom the other mutations 
borne by the particular mutant. Screening assays of the single mutant thus produced will then 
allow the determination of the effect of that mutation alone. 
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In one embodiment, the Pfix mutant is G387P, which reduces the error rate of wild type 
Pfii DNA polymerase by 3-fold in a Pfii blend when added at 5-25ng/50^1 reaction. The Pfu 
G387P mutant also reduces the error rate of Taq by approximately 5- to 10-fold in a blend when 
added at 6/60ng/50|al reaction, Pfu G387P exhibited 0.4% DNA polymerase activity and 57% 
exonuclease activity (i.e., relative to wild type Pfu) in a preliminary screen of partially purified 
(-50% purity) His-tagged proteins, eluted from nickel columns (Table 1). After column 
cliromatography (-95% purity), the His-tagged Pfii G387P mutant was found to be devoid of 
detectable DNA polymerase activity (<0.01% activity relative to wild type Pfii) (Table 3). 

C, Expression of Wild-type or Mutant enzymes According to the Invention 

Methods known in the art may be appUed to express and isolate the mutated forms of 
DNA polymerase (i.e., the second enzyme) according to the invention. The methods described 
here can be also applied for the expression of wild-type enzymes usefiil (e.g., the first enzyme) in 
the invention. Many bacterial expression vectors contain sequence elements or combinations of 
sequence elements allowing high level inducible expression of the protein encoded by a foreign 
sequence. For example, as mentioned above, bacteria expressing an integrated inducible form of 
the T7 RNA polymerase gene may be transformed with an expression vector bearing a mutated 
DNA polymerase gene linked to the T7 promoter. Induction of the T7 RNA polymerase by 
addition of an appropriate inducer, for example, isopropyl-p-D-thiogalactopyranoside (IPTG) for 
a lac-inducible promoter, induces the high level expression of the mutated gene from the T7 
promoter. 

Appropriate host strains of bacteria may be selected from those available in the art by one 
of skill in the art. As a non-limiting example, E. coU strain BL-21 is commonly used for 
expression of exogenous proteins since it is protease deficient relative to other strains of E. coli. 
BL-21 strains bearing an inducible T7 RNA polymerase gene include WJ56 and ER25 66 
(Gardner & Jack, 1999, supra). For situations in which codon usage for the particular 
polymerase gene differs from that normally seen in E, coU genes, there are strains of BL-21 that 
are modified to carry tRNA genes encoding tRNAs with rarer anticodons (for example, argU, 
ileY, leuW, and proL tRNA genes), allowing high efficiency expression of cloned protein genes, 
for example, cloned archaeal enzyme genes (several BL21-CODON PLUSTM cell strains 
carrying rare-codon tRNAs are available from Stratagene, for example). 
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There are many methods known to those of skill in the art that are suitable for the 
purification of a modified DNA polymerase of the invention. For example, the method of 
Lawyer et al. (1993, PGR Meth. & Ap p. 2: 275) is well suited for the isolation of DNA 
polymerases expressed in E. coU, as it was designed originally for the isolation of Taq 
polymerase. Alternatively, the method of Kong et aL (1993, J. Biol. Chem. 268: 1965, 
incorporated herein by reference) may be used, which employs a heat denaturation step to 
destroy host proteins, and two column purification steps (over DEAE-Sepharose and hepaiin- 
Sepharose columns) to isolate highly active and approximately 80% pure DNA polymerase. 
Further, DNA polymerase mutants may be isolated by an ammoniimi sulfate fractionation, 
followed by Q Sepharose and DNA cellulose columns, or by adsorption of contaminants on a 
HiTrap Q column, followed by gradient elution from a HiTrap heparin column. 

In one embodiment, the Pfii mutants are expressed and purified as described in U.S. 
Patent No. 5,489,523, thereby incorporated by reference in its entirety. 

D. Other Methods For Reducing Polymerization Activity 

hi order to prevent the side effects of having a high DNA polymerization activity in an 
amplification reaction, the polymerization activity of the composition of the invention may also 
be reduced by physical and/or chemical modification and/or inhibition. 

The polymerization activity of the subject composition may be reduced by chemical 
and/or physical means. Conditions which preferentially inhibit the polymerization activity of a 
DNA polymerase is known in the art (for reviews, see Johnson, 1993, supra; Wright, 1996, Acta 
Biochim Pol 43:1 15-24; EUon, 1982, Am J Med., 73:7-13). The level of polymerization activity 
need only be reduced to that level of activity which does not interfere with amplification 
reactions (e.g., does not significantly affect the exo"*^ activity of the composition or the efiSciency 
yield of the amplification reaction). 

Concentrations of Mg^^ greater than 5 mM inhibit the polymerization activity of the Pfii 
DNA polymerase. The effect of a given concentration of Mg^"*" for a given DNA polymerase 
may be determined by quantitation of the efficiency and specificity of polymCTization. 

The inhibitory effect of other ions, polyamines, denaturants, such as urea, formamide, 
dimethylsulfoxide, glycerol and non-ionic detergents (Triton X-100 and Tween-20), 
polynucleotide binding chemicals such as, actmomycin D, ethidium bromide and psoralens, may 
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be tested by their addition to the standard reaction buffers for polynucleotide aniplificaiton (e.g., 
PGR). Those compoimds having a preferential inhibitory effect on the polymerization activity 
but not significantly affecting the 3 '-5' exonuclease activity of a DNA polymerase are then used 
to create reaction conditions under which 3 '-5' nuclease activity is retained while polymerization 
activity is reduced. 

Physical means may be used to preferentially inhibit the polymerization activity of a 
polymerase. For example, the polymerization activity of thermostable polymerases is destroyed 
by exposure of the polymerase to extreme heat (typically 96°C to lOO^C) for extended periods of 
time (greater than or equal to 20 minutes). Wliile there are minor differences witli respect to the 
specific heat tolerance for each of the enzymes, these are readily determined. The polymerase 
mixture of the invention or the exo"*" DNA polymerase used as the second enzyme with reduced 
polymerization activity can be treated with heat for various periods of time and the effect of the 
heat treatment upon tlie polymerization and 3 '-5' nuclease activities is determined. Conditions 
reducing DNA polymerase activity but not significantly affecting the 3 '-5' exonuclease activity 
may be used to pretreat the polymerase mixture or the exo**" DNA polymerase used as second 
enzyme with reduced polymerization activity in the present invention: 

Enzvme Mixture 

The subject enzyme mixture composition comprises a first enzyme comprising DNA 
polymerization activity and a second enzyme comprising 3'-5' exonuclease activity with reduced 
DNA polym^ase activity. 

In one embodiment, the first enzyme is a DNA polymerase with 3 '-5' exonuclease 
activity. The fidelity of the first enzyme for DNA amplification is increased by the use of a 
second enzyme which also possesses 3'-5' Exonuclease activity. A preferred DNA polymerase 
with 3'-5' exonuclease activity as tiie first enzyme is a wild type Pfu DNA polymerase. 

In another embodiment, the first en2yme is a DNA polymerase without 3 '-5' exonuclease 
activity. The fidelity of an anq)lification reaction is provided by the second enzyme of the 
subject invention, \^ch possesses 3*-5* exonuclease activity. A preferred DNA polymerase 
without 3 '-5' exonuclease activity as the first enzyme is a Taq DNA polymerase. 
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In yet another embodiment, the first enzyme may is a reverse transcriptase with DNA 
polymerization activity. The fidehty of the reverse transcriptase in cDNA synthesis is increased 
by the use of a second enzyme which possesses 3 '-5* exonuclease activity. 

A. Selection of the first and the second enzyiiie pair 

In the subject method for DNA syntliesis, any enzyme comprising DNA polymerization 
activity may be mixed with a second enzyme comprising 3 '-5' exonuclease activity and reduced 
polymerization activity. 

When both first and second enzymes in the mixture comprise 3 '-5' exonuclease activity, 
it may be desirable to combine two enzymes with different proofireading activities. By "different 
proofireading activity", it means that two 3 '-5' exonucleases exhibits different proofi-eading 
preference for a nucleotide. For example, one 3 '-5' exonuclease may proofi^ead a G-T mispair 
more efficiently than an A-A mispair, another exonuclease having a different proofireading 
preference may proofi-ead an A-A mispair more efficiently than a G-T mispair. By using a 
second enzyme with a different proofi-eading preference from the first enzyme of the subject 
composition, one can enhance proofreading of the first enzyme by providing proofreading to 
mispairs which the first enzyme is not capable of recognizing and excising efficiently. 

Another factor to consider when selecting the first and the second enzymes of the subject 
invention is the compatibiUty of reaction conditions (e.g., pH, buffer composition, temperatiu^e 
requirement, etc.) required by each enzyme. 

La a preferred embodiment, the subject composition comprises a wild-type Pfii DNA 
polymerase as the first enzyme and a mutant Pfu DNA polymerase with reduced DNA 
polymerization activity as the second enzyme. Preferably, the mixture comprises a ratio of 2.5- 
5U Pfu DNA polymerase plxis an amount of a polymerase reduced mutant corresponding to 
<0.01U DNA polymerase activity and 0.007U to 0.04U of 3 '-5' exonuclease activity (or the 
amount of exonuclease activity containing within approximately 0.5 to lOU wild type Pfii). 
More preferably, tiie mixture comprises a ratio of 2.5-5U Pfu DNA polymerase plus an amount 
of a polymerase reduced mutant corresponding to <0,01U DNA polymerase activity and 0.02U 
of 3 '-5' exonuclease activity (or the amount of exonuclease activity contained within 2-311 wild 
type Pfu). In a preferred embodiment, tiie enzyme mixture composition comprises a wild-type 
Pfu DNA polymerase wifli 2.5U DNA polymerization activity and 0.02U 3'-5' exonuclease 
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activity as the first enzyme and a mutant DNA polymerase with reduced DNA polymerization 
activity (e.g., G387P) with 0.02U 3 '-5* exonuclease activity as the second enzyme. 

In another preferred embodiment, the subject composition comprises a wild-type Taq 
DNA polymerase as the first enzyme and a mutant Pfii DNA polymerase with reduced DNA 
5 polymerization activity as the second enzyme.* Preferably, the enzyme mixture comprises a ratio 
of 2.5U Taq DNA polymerase plus an amount of a polymerase deficient mutant corresponding to 
<OJU DNA polymerase activity and 0.01 to 0.2U of 3 '-5' exonuclease activity (or the amount of 
exonuclease activity contained within 1-20U wild type Pfii). More preferably, the enzyme 
mixture comprises a ratio of 2.5U Taq DNA polymerase plus an amount of a polymerase 

10 deficient mutant corresponding to <0.01U DNA polymerase activity and 0.08U of 3 '-5' 

exonuclease activity (or the amount of exonuclease activity contained within 10-12U wild type 
Pfu), In a preferred embodiment, the enzyme misture composition comprises a wild-type Taq 
DNA polymerase with 2.5U polymerization activity as the first enzyme and a mutant Pfii DNA 
polymerase with reduced polymerization activity (e.g., G387P) with 0.08U 3 '-5' exonuclease 

15 activity. 

Preferably the mutant Pfii DNA polymerase with reduced DNA polymerization activity 
comprises one or more mutations at amino acid positions selected fi'om the group consisting of: 
D405, Y410, T542, D543, K593, Y595, Y385, G387, and G388. 

More preferably, the mutant Pfu DNA polymerase comprises one or more mutations 
20 selected firom the group consisting of: D405E, Y410F, T542P, D543G, K593T, Y595S, Y385Q, 
Y385S, Y385N, Y385L, Y385H, G387S, G387P, and G388P. 

B. The Ratio Of Polymerization To Exonuclease Activity In The Enzyme Mixture 

In a variety of DNA synthesis and amplification procedures, the compositions of the 
present invention provide superior synthesis results (e.g., higher fidelity and efficiency), as 
25 compared with the synthesis results obtained with a single DNA polymerase or with a mixture 
comprising two wild type DNA polymerases. When using the subject composition, the ratio of 
total polymerization activity and total exonuclease activity in the enzyme mixture may be critical 
for optimal efificiency and fidelity of DNA synthesis. 

In the enzyme mixture of the subject invention, when DNA polymerases are used as tlie 
30 fijTSt and second enzymes, both enzymes may contribute to the polymerization and/or 3 '-5' 
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exonuclease activity. When an enzyme otlier than a conventional DNA polymerase is used as 
the first enzyme (e.g., a reverse transcriptase), both enzymes may contribute to DNA 
polymerization activity, but only the second enzyme contribute to tlie 3 '-5' exonuclease activity. 
Wlien an enzyme other than a DNA polymerase is used as the second enzyme (e.g., E. coli 
5 exonuclease I), both enzymes may contribute to the 3 '-5' exonuclease activity, but only tlie first 
enzyme contribute to the polymerization activity of the enzyme mixture. 

The ratio of the first and the second enzyme in the subject composition may be varied 
with respect to one another. The ratio of the DNA polymerization activity to 3-5' exonuclease 
. activity present in the subject composition employed in a given synthesis procedure may be 
10 readily optimized by perfomiing a series of simple experiments in which the ratio of the DNA 
polymerization activity to the exonuclease activity in the reaction mixture are systematically 
varied with respect to one another and the synthesis results compared. 

3 '-5' exonuclease activity has been shown to degrade imaimealed primers. The degraded 
primers would not be available in subsequent rounds of DNA amplification and would therefore 

15 effect the efficiency of the PGR reaction. In appUcations requiring very higji product yield, it 
may therefore be desirable to have a low concentration of the exonuclease activity relative to the 
DNA polymerization activity to decrease this effect and to increase the product yield. However, 
when fidelity is more important than yield, it may be desirable to have a higji concentration of 
the exonuclease activity relative to the DNA polymerization activity to increase the accuracy of 

20 the synthesis or amplification so long as the level of polymerization activity does not 
significantly inhibit the efficiency of the amplification. 

In a preferred embodiment, the ratio of the DNA polymerase activity and tiie exonuclease 
activity in the enzyme mixture is about (2,5-5U of DNA polymerization activity)/(0.02-5U of 3'- 
5' exonulcease activity), for example, about (2.5U of DNA polymerization activity)/(0,04-0.08U 
25 of 3 '-5' exonulcease activity). 

A pplications of The Subject Invention 

In one aspect, the invention provides a method for DNA synthesis using the compositions 
of the subject invention. The subject compositions may be used in various methods of 
polynucleotide synthesis in essentially the same manner as the DNA polymerase or other 
30 syntiietic enzyme present in the subject composition. Typically, synthesis of a polynucleotide 
requires a synthesis primer, a synthesis template, polynucleotide precxirsors for incorporation 
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into the newly synthesized polynucleotide, (e.g, dATP, dCTP, dGTP, dTTP), and the like. 
Detailed methods for carrying put polynucleotide synthesis are well known to the person of 
ordinary skill in tlie art and can be found, for example, in Molecular Cloning second edition , 
Sambrook et al.. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989), 

5 A. Application In Amplification Reactions 

"Polymerase chain reaction'' or "PCR" refers to an in vitro metliod for amplifying a 
specific polynucleotide template sequence. Tlie technique of PCR is described in numeroiis 
publications, including, PCR: A Practical Approach, M. J. McPherson, et al., IRL Press (1991 ), 
PCR Protocols: A Guide to Methods and Applications, by Innis, et al.. Academic Press (1990), 
1 0 arid PCR Technology: Principals and Applications for DNA Amplification, H. A. Erlich, 

Stockton Press (1989). PCR is also described in many U.S. Patents, including U.S. Patent Nos. 
4,683,195; 4,683,202; 4,800,159; 4,965,188; 4,889,818; 5,075,216; 5,079,352; 5,104,792; 
5,023,171 ; 5,091,3 10; and 5,066,584, each of which is herein incoqiorated by reference. 

For ease of xmderstanding the advantages provided by the present invention, a summary 
15 of PCR is provided. The PCR reaction involves a repetitive series of temperature cycles and is 
typically performed in a volume of 50-100 |il. The reaction mix comprises dNTPs (each of the 
four deoxynucleotides dATP, dCTP, dGTP, and dTTP), primers, buffers, DNA polymerase, and 
polynucleotide template. PCR requires two primers that hybridize with the double-stranded 
target polynucleotide sequence to be amplified. In PCR, this double-stranded target sequence is 
20 denatured and one primer is annealed to each strand of the denatured target. The primers anneal 
to the target polynucleotide at sites removed firom one another and in orientations such that the 
extension product of one primer, when separated firom its complement, can hybridize to the other 
primer. Once a given primer hybridizes to the target sequence, the primer is extended by the 
action of a DNA polymerase. The extension product is tiien denatured from the target sequence, 
25 and the process is repeated* 

In successive cycles of this process, the extension products produced in earlier cycles 
serve as templates for DNA synthesis. Beginning in the second cycle, the product of 
amplification begins to acctunulate at a logarithmic rate. The amplification product is a discrete 
double-stranded DNA molecule comprising: a first strand which contains the sequence of the 
30 first primer, eventually followed by the sequence complementary to the second primer, and a 
second strand which is complementary to the first strand. 
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Due to the enormous amplification possible with the PGR process, small levels of DNA 
carryover firom samples with high DNA levels, positive control templates or firom previous 
amplifications can result in PGR product, even in the absence of purposefully added template 
DNA. If possible, all reaction mixes are set up in an area separate fi-om PGR product analysis 
5 and sample preparation. The use of dedicated or disposable vessels, solutions, and pipettes 
(preferably positive displacement pipettes) for RNA/DNA preparation, reaction mixing, and 
sample analysis will minimize cross contamination. See also Higuchi and Kwok, 1989, Nature, 
339:237-238 and Kwok, and Orrego, in: Innis et al. eds., 1990, PGR Protocols: A Guide to 
Methods and Applications, Academic Press, Inc., San Diego, Calif., which are incorporated 
1 0 herein by reference. 

L Thermostable Enzymes 

For PGR amplifications, the enzymes used in the invention are preferably thermostable. 
As used herein, *^ermostable" refers to an enzyme which is stable to heat, is heat resistant, and 
functions at high temperatures, e.g., 50 to 90°^. The themiostable enzyme according to the 

1 5 present invention must satisfy a single criterion to be effective for the amplification reaction, i.e., 
the enzyme must not become irreversibly denatured (inactivated) when subjected to the elevated 
temperatures for the time necessary to effect denaturation of double-stranded polynucleotides. 
By "irreversible denaturation" as used in diis connection, is meant a process bringing a 
permanent and complete loss of enzymatic activity. The heating conditions necessary for 

20 denaturation will depend, e.g., on the buffer salt concentration and the lengfli and nucleotide 
composition of the polynucleotides being denatured, but typically range from 85°C^ for shorter 
polynucleotides, to 105^C for a time depending mainly on the temperature and the 
polynucleotide length, typically from 0.25 minutes for shorter polynucleotides, to 4.0 minutes for 
longer pieces of DNA. Hi^er temperatures may be tolerated as the buffer salt concentration 

25 and/or GG comjposition of the polynucleotide is increased. Preferably, the enzyme will not 
become irreversibly denatured at 90 to lOO^G, An enzyme that does not become irreversibly 
denatured, according to the invention, retains at least 10%, or at least 25%, or at least 50% or 
more function or activity during the amplification reaction. 

2. PGR Reaction Mixture 

30 In addition to the subject enzyme mixture, one of average skill in the art may also employ 

other PGR parameters to increase the fidelity of synthesis/amplification reaction. It has been 
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reported PGR fidelity may be affected by factors such as changes in dNTP concentration, pH, 
units of enzyme used per reaction, and the ratio of Mg^"*^ to dNTPs present in the reaction 
(Mattila et al., 1991, supra). 

Mg^"^ concentration affects the amieahng of the oUgonucleotide primers to the template 
5 DNA by stabilizing the primer-template interaction, it also stabilizes the replication complex of 
polymerase with template-primer. It can therefore also increases non-specific annealing and 
produced undesirable PGR products (gives multiple bands in gel). When non-specific 
amplification occms, Mg^"*^ may need to be lowered or EDTA can be added to chelate Mg^"*" to 
increase the accuracy and specificity of the amplification. 

10 Other divalent cations such as Mn^"*", or Go^"*^ can also affect DNA polymerization. 

Suitable cations for each DNA polymerase are known in the art (e.g., in DNA Replication 2"^ 
edition, supra). Divalent cation is suppUed in the form of a salt such MgGl2, Mg(OAc)2, MgS04, 
MnGl2, Mn(OAc)2, or MnS04- Usable cation concentrations in a Tris-HGl buffer are for MnGk 
firom 0.5 to 7 mM, preferably, between 0.5 and 2 mM, and for MgCh firom 0.5 to 10 inM. 

1 5 Usable cation concentrations in a Bicine/KOAc buffer are firom 1 to 20 mM for Ma(pA6)2, 
preferably between 2 and 5 mM. 

Monovalent cation required by DNA polymerase may be supplied by the potassiirai, 
sodium, ammoniimi, or lithium salts of either chloride or acetate. For KGl, the concentration is 
between 1 and 200 niM, preferably the concentration is between 5 and 100 mM, although the 
20 optimum concentration may vary depending on the polymerase used in the reaction, 

Deoxyribonucleotide triphosphates (dNTPs) are added as solutions of the salts of dATP, 
dCTP, dGTP, dUTP, and dTTP, such as disodium or lithium salts. In the present methods, a 
final concentration in the range of 1 jjM to 2 mM each is suitable, and 100-600 |xM is preferable, 
although the optimal concentration of the nucleotides may vary in the reverse transcription 
25 reaction depending on the total dNTP and divaloit metal ion concentration, and on the buffer, 
salts, particular primers, and template. For longer products, i.e., greater than 1500 bp, 500 |xM 
each dNTP may be preferred when using a Tris-HCl buffer. 

dNTPs chelate divalent cations, therefore amount of divalent cations used may need to be 
changed according to the dNTP concentration in the reaction. Excessive amount of dNTPs (e.g., 
30 larger than 1.5 mM) can increase the eitor rate and possibly inhibits DNA polymerases. 
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Lowering the dNTP (e.g., to 10-50 |j,M) may tlierefore reduce error rate. PGR reaction for 
amplifying larger size template may need more dNTPs. 

One suitable buffering agent is Tris-HCl, preferably pH 8.3, although the pH may be in 
the range 8.0-8.8. The Tris-HCl concentration is from 5-250 mM, although 10-100 mM is most 
preferred. A preferred buffering agent is Bicine-KOH, preferably pH 8.3, although pH may be in 
the range 7.8-8.7. Bicine acts both as a pH buffer and as a metal buffer. 

PGR is a very powerful tool for DNA amplification therefore very little template DNA is 
needed. However, in some embodiments, to reduce the likelihood of error, a higher DNA 
concentration may be used, though too many templates may increase the amount of contaminants 
and reduce efficiency. 

Usually, up to 3 |iM of primers may be used, but high primer to template ratio can results 
in non-specific ampUfication and primer-dimer formation. Therefore it is usually necessary to 
check primer sequences to avoid primer-dimer formation. In a preferred embodiment, 0.1-0.5 
pM of primers are used. 

3. GyclLng Parameters 

Denaturation time may be increased if template GC content is hi^. Higjher annealing 
temperature may be needed for primers with high GC content or longer primers. Gradient PGR 
is a useful way of determining the annealing temperature. Extension time should be extended for 
larger PGR product amplifications. However, extension time may need to be reduced whenever 
possible to limit damage to enzyme. 

The number of cycle can be increased if the number of template DNA is very low, and 
decreased if high amount of template DNA is used. 

4, PGR Enhancing Factors And Additives 

PGR enhancing factors may also be used to improve efficiency of the amplification. As 
used herein, a *TGR enhancing factor" or a "Polymerase Enhancing Factor'' (PEF) refers to a 
complex or protein possessing polynucleotide polymerase enhancing activity (Hogrefe et al., 
1997, Strategies 10::93-96; and U.S. Patent No. 6,183,997, both of which are hereby 
incorporated by references). For Pfii DNA polymerase, PEF comprises either P45 in native form 
(as a complex of P50 and P45) or as a recombinant protein. In the native complex of Pfu P50 
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and P45, only P45 exhibits PGR enhancing activity. The P50 protein is similar in structure to a 
bacterial flavoprotein. The P45 protein is similar in structure to dCTP deaminase and dUTPase, 
but it functions only as a dUTPase converting dUTP to dUMP and pyrophosphate. PEF, 
according to the present invention, can also be selected from the group consisting of: an isolated 
5 or purified naturally occurring polymerase enhancing protem obtained from an archeabacteria 
source (e.g., Pyrococcus furiosus); a wholly or partially synthetic protein having the same amino 
acid sequence as Pfii P45, or analogs thereof possessing pol3anerase enhancing activity; 
pol>Tnerase-enhancing mixtures of one or more of said naturally occurring or v/hoUy or partially 
synthetic proteins; poljanerase-enliancing protein complexes of one or more of said naturally 

10 occurring or wholly or partially synthetic proteins; or polymerase-enhancing partially purified 
cell extracts containing one or more of said naturally occurring proteins (U.S. Patent No. 
6,1 83,997, supra). The PGR enhancing activity of PEF is defined by means well known in the 
art. The unit definition for PEF is based on the dUTPase activity of PEF (P45), which is 
determined by monitoring the production of pyrophosphate (PPi) from dUTP. For example, PEF 

15 is incubated with dUTP (lOmM dUTP in Ix cloned Pfu PGR buffer) during which time PEF 
hydrolyzes dUTP to dUMP and PPi. The amount of PPi formed is quantitated using a coupled 
enzymatic assay system that is commercially available from Sigma (#P7275), One unit of 
activity is fimctionally defined as 4.0 mnole of PPi formed per hour (at 85°G). 

Other PGR additives may also affect the accuracy and specificity of PGR reaction. 

20 EDTA less than 0.5 mM may be present in the ampUfication reaction mix. Detergents such as 
Tween-20™ and Nonidet™ P-40 are present in the enzyme dilution buffers.. A final 
concentration of non-ionic detergent approximately 0.1% or less is appropriate, however, 0,01- 
0.05% is preferred and will not interfere with polymerase activity. Similarly, glycerol is often 
present in enzyme preparations and is generally diluted to a concentration of 1-20% in the 

25 reaction mix. Glycerol (5-10%), formamide (1-5%) or DMSO (2-10%) can be added in PGR for 
template DNA with high GG content or long length (e.g., > Ikb). These additives change the Tm 
(melting temperature) of primer-template hybridization reaction and the thermostability of 
polymerase enzyme. BSA (up to 0.8 jig/iiil) can improve efiBciency of PGR reaction. Betaine 
(0.5-2M) is also usefiil for PGR over high GG content and long fiagments of DNA. 

30 Tetramethylammonium chloride (TMAG, >50mM), Tetraethylammoniima chloride (TEAG), and 
Trimethlamine N-oxide (TMANO) may also be used. Test PGR reactions may be performed to 
determine optimum concentration of each additive mentioned above. 
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Various specific PGR ampUfication applications are available in the art (for reviews, see 
for example, Erlich, 1999, Rev Jmmunogenet ., 1:127-34; Prediger 2001, Methods Mol. Biol. 
160:49-63; Jurecic et al., 2000, Curr. Qpin. Microbiol. 3:316-21; Triglia, 2000, Methods Mol. 
Biol. 130:79-83; MaClelland et al., 1994, PHR Methods Appl. 4:S66-81; Abramson aiid Myers, 
5 1993, Current Opinion in Biotechnology 4:41-47; each of which is incorporated herein by 
references). 

The subject invention can be used in PGR appUciations mclude, but are not limited to, i) 
hot-start PGR which reduces non-specific amplification; ii) touch-down PGR which starts at high 
annealing temperature, tlien decreases annealmg temperature in steps to reduce non-specific PGR 

10 product; iii) nested PGR which synthesizes more reliable product using an outer set of primers 
and an inner set of primers; iv) inverse PGR for amplification of regions flanking a known 
sequence. In this method, DNA is digested, the desired fragment is ckcularized by ligation, flien 
PGR using primer complementary to the known sequence extending outwards; v) AP-PGR 
(arbitrary primed)/RAPD (random amplified polymorphic DNA). These methods create 

15 geiibmic fingerprints from species with little-known target sequences by amplifying using 

arbitrary oligonucleotides; vi) RT-PGR which uses RNA-directed DNA polymerase (e.g., reverse 
transcriptase) to synthesize cDNAs which is then used for PGR. This method is extremely 
sensitive for detecting the expression of a specific sequence in a tissue or cells. It may also be 
use to quantify mRNA transcripts; vii) RAGE (rapid ampUfication of cDNA ends). This is used 

20 where information about DNA/protein sequence is limited. The method amplifies 3' or 5' ends of 
cDNAs generating fragments of cDNA with only one specific primer each (plus one adaptor 
primer). Overlapping RAGE products can then be combined to produce fiill length cDNA; viii) 
DD-PGR (differential display PGR) which is used to identify differentially expressed genes in 
different tissues. First step in DD-PGR involves RT-PGR, then amplification is performed using 

25 ^hort, intentionally nonspecific primers; ix) Multiplex-PGR in which two or more unique targets 
of DNA sequ«ices in tiie same specimen are amplified simultaneously. One DNA sequence can 
be use as control to verify the quaUfy of PGR; x) Q/G-PCR (Quantitative comparative) which 
•uses an internal control DNA sequence (but of different size) which compete with the target 
DNA (competitive PGR) for the same set of primers; xi) Recusive PGR which is used to 

30 synthesize genes. Ohgonucleotides used m this mettiod are complementary to stretches of a gene 
(>80 bases), alternately to the sense and to the antisense strands with ends ovwlapping (-20 
bases); xii) Asymmetric PGR; xiii) In Situ PGR; xiv) Site-directed PGR Mutagenesis. 
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It should be understood that this invention is not Umited to any particular amplification 
system. As other systems are developed, those systems inay benefit by practice of this invention, 
A recent survey of amplification systems was published in. 

B. Applications In Reverse Transcription 

5 The term "reverse transcriptase'* describes a class of polymerase characterized as RNA- 

dependent DNA polymerases. All loiown reverse transcriptases require a primer to synthesize a 
DNA transcript fi-om an RNA template. Historically, reverse transcriptase has been used 
primarily to transcribe mRNA into cDNA which can then be cloned into a vector for fiirther 
manipulation (e.g., PGR amplification by a DNA-dependent DNA polymerase). 

10 Avian myoblastosis virus (AMV) reverse transcriptase was the first widely used RNA- 

dependent DNA polymerase (Verma, 1977, Biochem.Biophys.Acta 473:1). The enzyme has 5-3' 
RNA-directed DNA polymerase activity, 5-3' DNA-directed DNA polymerase activity, and 
RNase H activity. RNase H is a processive 5* and 3* ribonuclease specific for the RNA strand of 
RNA-DNA hybrids (Perbal, 1984, A Practical Guide to Molecular Cloning. Wiley & Sons New 

15 York). Errors in transcription cannot be corrected by reverse transcriptase because known viral 
reverse transcriptases lack the 3-5* exonuclease activity necessary for proofireading (Saunders 
and Saunders, 1987, Microbial Genetics Applied t o Biotechn ology. Groom Helm, London). The 
use of the second enzyme in the subject composition provides proofireading for ttie reverse 
transcription reaction. A detailed study of the activity of AMV reverse transcriptase and its 

20 associated KNase H activity has been presented by Berger et al., 1983, Biochemistry 22:2365- 
2372. 

The reaction mixture for reverse transcription usually includes enzymes, aqueous buffers, 
salts, oligonucleotide primers, target polynucleotide, and nucleoside triphosphates. Depending 
upon the context, the mixture can be either a complete or incomplete reverse transcription 
25 reaction mixture. The reaction mixture can be modified according to the conditions required by 
the second enzyme of the subject composition. It is known that cDNAs can be obtained from 
mRNAs in vitro using a reverse transcriptase (RNA-dependent DNA polymerase). The fiiU 
length cDNA strands produced in turn may be used as a template for subsequent amplification 
reaction (e.g., PGR) and the like. 
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Reverse transcription in combination with PGR (RT-PCT) is utilized to detect the 
presence of one or many specific RNA molecules which may be present in a sample. The 
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method can be used to detect, for example, RNA from different organisms (such as viruses, 
bacteria, fungi, plants, and animals), or RNA indicative of an infection, a disease state, or 
predisposition to a disease. For example, mRNA specific to tumor cells can be detected. The 
method is also useful for detecting a class of microorganisms or a group of related disease 
5 conditions. 

Reverse transcription can generally be perfomied at any temperature within the 
functional temperature range of the reverse transcriptase. Preferably, the temperature of 
incubation is any temperature at which the reverse transcriptase is functional and the primer 
remains hybridized to the RNA molecule. For non-thermostable reverse transcriptases, preferred 
10 temperatures are those temperatures that are at or around the optimum temperature for the 

reverse transcriptase. For most non-thermostable reverse transcriptases this temperature will be 
between about IS^'C and 45''C 

U.S. Patent No. 5,994,079 discloses thermostable reverse transcriptases (herein 
incorporated by reference). Mn^"*" is preferred as the divalent cation and is typically included as a 
15 salt, for example, manganese chloride (MnCli), manganese acetate (Mja(OAc)2), or manganese 
sulfate (MnS04). If MnCla is included in a reaction containing 10 mM Tris buffer, for example, 
the MnCl2 is generally present at a concentration of 0.5-7.0 mM; 0.8-1,4 mM is preferred when 
200 ^iM of each dGTP, dATP, dUTP, and, dCTP are utilized; an 1.2 mM MnCh is most 
preferred. 

20 A thermostable reverse transcriptase may retain at least 5% of its maximum activity at 

any temperature above 50°C or has an optimal temp^ature of at least 50**C. The highest 
temperature at which a thermostable reverse transcriptase is functional can be quite high. For 
this reason, preferred temperature ranges for reverse transcription when a thermostable reverse 
transcriptase is used are most conveniently described in terms of flie calculated melting 

25 temperatm-e of a hybrid between the RNA molecule of interest and the primer. Such a melting 
temperature is referred to herein as the RNA/primer melting temperature (R/P Tm). Preferred 
ranges include a temperature from 20°C below tiie melting temperatwe of a hybrid between the 
RNA molecule of interest and the primer and 5°C above the melting temperature of a hybrid 
between the RNA molecule of interest and the primer. In general, the closer the traiperature is 

30 to the R/P Tm, the greater the degree of discrimination there will be between specific and non- 
specific hybrids of the RNA arid primer. If the temperature is close to the R/P Tm, however, 
decreased stability of specific hj^rids may cause priming to be less efficient. 
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Ryp Tm can be determined either by calculation or by empirical measurement. For 
calculating R/P Tm, any established formula for calculating stability of polynucleotide hybrids 
can be used. A preferred formula for calculating R/P Tm is Tm=8 1.5+1 6.6(log M)^0.41(% 
G"**C)-0.72(% formamide), which was derived from studies on the stability of perfectly-matched 
5 DNA:DNA hybrids. For RNArDNA hybrids, incorporating formamide concentration in the 
formula does not hold because the relationship between formamide concentration and the 
depression of Tm is not linear. At 80% formamide, RNA:DNA hybrids are more stable than 
DNAiDNA hybrids, increasing the Tm by about 10 to 30°C depending on the sequence (Hames 
& Higgins, Polynucleotide Hybridisation: A Practical Approach (IRL Press Limited, Oxford, 

10 England. 1985)). Carrying out the reaction in 80% fomiamide can therefore also be used to 
suppress formation of DNA:DNA duplexes, to preferentially select RNAiDNA hybrids, and to 
estimate the Tm for R/P. Because the empirically derived formulas for the estimation of 
RNAiDNA hybrid Tm may not be as accxirate for short DNA primers, the hybridization 
temperature is preferably determined by assessing hybrid stability in 0.1-0.4 M monovalent 

15 cation at temperatures ranging from 40 to 60®C R/P Tm can also be determined empirically 

(Lesnick and Freier, 1995, Biochemistry 34:10807-10815, McGraw et aL, 1990, Biotechniques 
8:674-678; and Rychlik et al., 1990, Polynucleotides Res. 18:6409-6412). 

The fideUty of viral reverse transcriptases, such as AMV-RT and MoMuLV-RT, may be 
compared to thermoactive reverse transcriptases by a straightforward assay procedure described 

20 in U.S. Patent No. 5,994,079 (supra). Plasmid BS^ (Stratagene) can be used for such an assay. 
The plasmid encodes an a-complementing p-galactosidase activity and can be linearized with 
Ndel. T3 RNA polymerase is used to prepare a cRNA transcript of the a-donor region. After 
treatment of the cRNA with KNase-free DNase and isolation of the cRNA, the cRN A is used as a 
template for a reverse transcription/ampUfrcation reaction. A reverse transcription primer 

25 coinplementary to the 3' end of the cDNA containing an Ndel sequence at its 5' terminus, and an 
upstream PGR primer comprising a PstI sequence at the 5' termini provide a 752 bp PC^ 
product The PGR product and the pBS*^ vector are then digested with Ndel and PstI followed by 
ligation of the PGR product into the vector and transformation into a suitable host. The presence 
of white colonies indicates that a mutation had occurred during the RT or PGR amplification. 

30 The assay provides means for assigning a relative value to the fidelity of the reverse transcriptase 
activity of various enzymes. Specific mutations can be determined by sequence analysis. 
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Following reverse transcription of RNA, the KNA can be removed from the RNA/cDNA 
hybrid by heat denaturation or by a number of other known means such as alkali, heat, or 
enzyme treatment. Enzyme treatment may consist of, for example, treating the KNA/cDNA 
hybrid with RNase H. RNase H is specific for RNA strands within an RNA/DNA double- 
5 stranded molecule. 

The subject composition is suitable for high fidelity transcribing and amplifying RNA 
from a number of sources. The RNA template may be contained within a polynucleotide 
preparation from an organism, for example, a viral or bacterial polynucleotide preparation. The 
preparation may contain cell debris and other components, purified total RNA, or purified 
10 mRNA. The RNA template may be a population of heterogeneous RNA molecules in a sample 
or a specific target RNA molecule. 

RNA suitable for use in the present methods may be contained in a biological sample 
suspected of containing a specific target RNA. The biological sample may be a heterogeneous 
sample in which RNA is a small portion of the sample, as in for example, a blood sample or a 
15 biopsied tissue sample. Thus, the subject composition is useful for clinical detection and 
diagnosis. The RNA target may be indicative of a specific disease or infectious agent. 

' RNA may be prepared by any number of methods known in the art; the choice may 
depend on the source of the sample and availability. Metliods for preparing RNA are described 
in Davis et al., 1986, Basic Methods in Molecular Biologv, Elsevier, NY, Chapter 11; Ausubel et 
20 aL, 1987, Current Protocols in Molecular Bioloev, Ch^ter 4, John Wiley and Sons, NY; 

Kawasaki and Wang, 1989, PGR Technologv, ed. Erlich, Stockton Press NY; Kawasaki, 1990, 
PGR Protocols: A Guide to Methods and Applications , frmis et al. eds. Academic Press, San 
Diego; all of which are incorporated herein by references. 

C. Detection Of Amplified Product 

25 Detection of amplified polynucleotide product can be accompUshed by any of a variety of 

well known techniques, hi a preferred embodiment, the amplified product is. separated on the 
basis of molecular weigjit by gel electrophoresis, and the separated products are then visualized 
by the use of polynucleotide specific stains which allow one to observe the discrete species of 
resolved anaplified product present in the gel. Although numerous polynucleotide specific stains 

30 exist and would be suitable to visualize the electrophoretically separated polynucleotides, 
ethidium bromide is preferred. 
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Alternative methods suitable to detect the amplified polynucleotide product include 
hybridization-based detection means that use a labeled polynucleotide probe capable of 
hybridizing to the amplified product. Exemplary of such detection means include the Southern 
blot analysis, ribonuclease protection analysis using in vitro labeled polyribonucleotide probes, 
and similar methods for detecting polynucleotides having specific nucleotide sequences. See, for 
example, Ausubel et aL, Cunent Protocols in Molecular Biologv. John Wiley & Sons, 1987. 

Amplified products (e.g., by PGR or RT-PCR) using the subject composition of the 
invention can be used for subsequent analysis such as sequencing or cloning, 

D. Application In Direct Cloning of PGR Amplified Product 

While it is understood that the ampUfied product using subject composition can be cloned 
by any method known in the art. In one embodiment, the invention provides a composition 
which allows direct cloning of PGR amplified product. 

The most common method for cloning PGR products involves incorporation of flanking 
restriction sites onto the ends of primer molecules. The PGR cycUng is carried out and the 
amplified DNA is then purified, restricted with an appropriate endonuclease(s) and ligated to a 
compatible vector preparation. 

A method for directly cloning PGR products eliminates tiie need for preparing primers 
having restriction recognition sequences and it woxild eliminate the need for a restriction step to 
prepare the PGR product for cloning. Additionally, such method would preferably allow cloning 
PGR products directly without an intervening purification step. 

U.S. Patent Nos. 5,827,657 and 5,487,993 (hereby incorporated by their entirety) 
discloses method for direct cloning of PGR products using a DNA polymerase which takes 
advantage of the single 3'-deoxy-adenosine monophosphate (dAMP) residues attached to the 3' 
termini of PGR gen^ated nucleic acids. Vectors are prepared with recognition sequences that 
afiford single 3 -terminal deoxy-thymidinc monophosphate (dTMP) residues upon reaction with a 
suitable restriction enzyme. Thus, PGR generated copies of genes can be directly cloned into the 
vectors without need for preparing primers having suitable restriction sites therein. 

Taq DNA polymerase exhibits terminal transferase activity that adds a single dATP to the 

3' ends of PGR products in the absence of template. This activity is the basis for the TA cloning 

method in which PGR products amplified with Taq are directed ligated into vectors containing 
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single 3'dT overhangs. Pfu DNA polymerase, on the other hand, lacks terminal transferase 
activity, and thus produces blunt-ended PGR products that are efficiently cloned into blxmt-ended 
vectors. 

In one embodiment, the subject invention comprises a Taq DNA polymerase as the first 
enzyme and a mutant Pfu DNA polymerase with reduced polymerization activity as the second 
enzyme. Taq DNA polymerase in the composition produces amplified DNA product with 3*- 
dAMP and allows direct cloning of the amplified product, while the mutant Pfu DNA 
polymerase provides fidelity for the amplification. 

Kits 

The invention herein also contemplates a kit format which comprises a package unit 
having one or more containers of the subject composition and in some embodiments -including 
containers of various reagents used for polynucleotide synthesis, including synthesis in PGR. 
Thexkit may also contain one or more of the following items: polynucleotide precursors, primers, 
buffers, instractions, and controls. Kits may include containers of reagents mixed together in 
suitable proportions for performing the methods in accordance with the invention. Reagent 
containers preferably contain reagents in unit quantities that obviate measuring steps when 
performing the subject methods. 

EXAMPLES 

The following examples are offered for the purpose of illustrating, not limiting, the 
subject invention. 

Example 1. Constructing Mutants Of Pfu DNA Polymerase With Reduced DNA 
Polymerase Activity 

We introduced mutations into Pfu DNA polymerase that were likely to reduce or 
eliminate DNA polymerase activity, while having minimal eflFects on proofireading activity. The 
mutations selected were identified firom previous mutagenesis studies carried out using related 
Family B DNA polymerases. We made the same amino acid side chain substitutions in the 
polymerization domain at the following residues in Pfu (D405E, Y410F, T542P, D543G, K593T, 
Y595S) (Table 1). 
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Mutations were also introduced witliin the partitioning domain at amino acids 384-389 
(SYTGGF) in Pfu DNA polymerase (Table 1). 

The DNA template used for mutagenesis contained tlie Pfii pol gene, cloned into 
pBluescript (pF72 clone described in US 5,489,523) and expressed witli an N-tenxiinal Hise tag 
5 for affinity purification. A modified QuikChange (Stratagene) protocol was used to insert the 
His6 tag at the 5 ' end of the Pfu pol gene, just after the initiator ATG. The insertion reaction was 
. carried out in two steps. In tiie first step, a standard QuikChange reaction was carried out in the 
presence of Tth ligase (lOU/RXN) using only the Hise forward primer. After 18 cycles, the 
reaction was ZJpnl-digested for one hour at 37^C and then purified with the StrataPrep® Plasmid 
10 Miniprep Kit (Stratagene). The purified material served as the template in the second 

QuikChange reaction, which employed only the Hise reverse primer. After 1 8 cycles, the second 
reaction was Dpnl-digested for one hour at 37°C, and then transformed. The Hise-Pfu pol 
construct was confimied by both PCR amplification and sequencing using the Big Dye 
sequencing kit. 

15 Point mutations were introduced into the Pfu pol gene using the QuikChange Site- 

Directed Mutagenesis Kit (Stratagene), Clones were sequenced to verify incorporation of the 
desired mutations. 



Table 1: Activity of partially-purified His-taggedi^M mutants (Nickel-resin elviates): 



Mutation 


Polymerase activity 


Exonuclease activity 


Rel. exo/pol 
vs. wt (1.0)* 




Cpms 

@50ng 

(500ng) 


% wild 
type 

@50ng* 

f 


Cpms 
(50 ng) 


% wild type® 
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Partitioning 












S3o4u 




71 








0*5 OAT/" 




1 no 






0 6 


Y385N 


1123 




1 <Q 


io 


lU.O 


Y385W 


10515 (24519) 


16 


iO 


4 


U.3 


Y385L 


2383 


4 


loU 


Z i 


D. / 


Y385H 


4276 


6 


91 


lU 


l.O 


Y385Q 


386(5431) 


0.6 


252 


29 


49.2 


Y385S 


1095 (4206) 


2 


578 . 


66 


39.8 


Y385F 


80685 (21580) 


100 


1008 


^ 1 f\i\ 
>100 


0.9 


T386E 


48296 


73 


263 


30 


0.4 


J386Y 


47318 


72 


1 1 1 O 

1112 


C\€\ 


1 Q 
l.O 


T386G 


46289 


70 


1011 


>100 


1.6 


G387S 


648 


1 




1 o 




G387P 


2do (oo) 










G388A 


2560 


4 


73 


.008 


2.2 


G388S 


74551 


100 


670 


76 


0.7 


G388P 


1222 


2 


202 


23 


12.5 
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"CO o cwz 

rjcjy Y 


43455 (29809) 


66 


3 / 


4 


.06 




72647 




100 


1U54 


>100 


1.1 


F389V 


30641 




46 


614 


70 


1.5 


F389S 


17998 




27 


1335 


>100 


5.6 


F389H 


19623 




30 


543 


62 


2.1 




Polymerase activity 


Exo/pol activity 


Polymerase 


Corns (3) 


% wild type 


Cpms 


Cpms 






5ng 


<§ 


J5ng* 


exonuclease 


Polymerase 


vs. wtO-O)*' 


DXXSLYP 














D405E 


69 (500ng) 


<0.2 


321 


0 


>396 


Y410F 


10181 


27 


698 


16189 


5.3 


YXDTDS 














T542P 


27 


.07 


1105 


0 


>1364 




10 


.03 


704 


687 


127 


T542P/D543G 


23 


.06 


505 


0 


>623 


KXY 














KS93T 


155 


.4 




668 


0 


>825 


Y595S 


6107 


16 


1072 


2684 
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100% for wtP/w equals: *66146 cpms; "38014 cpms; ®877cpms 
exo/pol for wt Pfu equals: *0.01326; *0.0081 
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Example 2. Affinity Purification Of His^Tagged Pfu DNA Polymerase Mutants 

Bacterial expression of Pfu mutants- Plasmid DNA was purified with the StrataPrep® 
Plasmid Miniprep Kit (Stratagene), and used to transform XL-10 Gold cells. Ampicillin resistant 
5 colonies were grown up in 1-5 liters of LB media containing Turbo Amp™ antibiotic (100|xg/^l) 
at 3TC with moderate aeration. The cells were collected by centrifugation and stored at '2(fC, 

Purification (His6 tag protocol/batch binding method): Cells pellets were resuspended 
in native bmding buffer (20mM phosphate (pH 7.8), 500mM NaCl), Egg white lysozyme 
(lOO^g/nil) was added and the cells were incubated for 15 minutes on ice. Cell suspensions were 

10 subjected to sonication three times with a Bronson Sonifier 250 at a duty cycle of 80% and an 
output level of 5 for 45 seconds. The suspensions were left on ice to cool between sonication 
events. The lysate was cleared by centrifugation at 26,S90g, The cleared lysates were added to 
5mls of ProBond Ni resin (Invitrogen), equilibrated in native binding buffer, and the slurry was 
incubated for two hours with gentle agitation at 4''C. The resin was settled by low speed 

15 centrifugation (800Xg). The resin was washed three times with 4ml of native binding buffer (pH 
7.8) by resuspending the resin, rocking the slurry for two minutes, and then separating the resin 
from the supernatant by gravity centrifugation. The resin was then washed in the same fashion 
with native wash buffer (20mM phosphate (pH 6.0), 500mM NaCl). Protein was eluted with two 
5-ml additions of 350mM Imidazole elution buffer (20mM phosphate, 500mM NaCl, 350mM 

20 Imidazole (pH 6.0)) by resuspending the resin, rocking the slurry for five minutes, and then 
separating the resin from the supernatant by gravity centrifugation. Eluted proteins were spin 
concentrated using Centricon 30 centrifugal filter devices (Amicon). Protein samples were 
. evaluated for size and purity by SDS-PAGE using Tris-Glycine 4-20% acrylamide gradient gels. 
Gels were stained with silver stain or Sypro Orange (Molecular Probes). 

25 Alternative expresssion/purification: Alternatively, Pfix mutants were subcloned into 

the pCAL-n-EK vector (Affinity^^ Protein Expression and Purification System) which contains 
an upstream, in-frame calmodulin binding peptide (CBP) tag for purifying fusion proteins witti 
calmodulm agarose* Plasmid DNA was pxirified with tiie StrataPrep® Plasmid Miniprep Kit 
(Stratagene), and used to transform BL210DE3) CodonPlus® cells. Ampicillin resistant colonies 

30 were grown up in 1-5 liters of LB media containing Txurbo Amp™ antibiotic (100jig/|il) at 30°C 
with moderate aeration. When cultures reached an absorbance at ODeoo of 0.6 to 1.0, the cells 
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were induced with ImM EPTG aiid incubated in the same niaiiner for 2 hours to overnight (16 
houi-s). Tlie cells were collected by centrifugation and stored at -20^C. 

Cells pellets were resuspended to aii approximate concentration of 0.25g/ml in buffers 
identical or similar to calcium binding buffer (50mM Tris-HCL (pH 8.0), 150 mM NaCI, ImM 
5 magnesium acetate and 2mM CaCl). Egg white lysozyme (100)xg/ml) was added arid the cells 
were incubated for 15 minutes on ice. Cell suspensions were subjected to sonication tliree times 
with a Bronson Sonifier 250 at a duty cycle of 80% and an output level of 5 for 45 seconds. The 
suspensions were left on ice to cool between sonication events. The lysate was cleared by 
centrifugation at 26,890g. 

10 The cleared lysates were added to 1ml of calmodulin agarose (CAM agarose), 

equilibrated in buffer, and the sluny was incubated with gentle agitation at 4°C- AftCT two hours 
the reactions were centrifiiged at 3000g for 5 minutes to collect the CAM agarose and 
recombinaiit protein. The lysate supematent was removed and the CAM agarose was washed at 
least once by resuspending the resin in 50ml of calcium binding buffer followed by collection of 

15 the CAM agarose by centrifugation as described above. The CAM agarose was transferred to a 
disposable 15ml column, packed, then washed with at least 200ml of calcimn binding buffer. 
Recombinant proteins were eluted from the column by using a buffer similar or identical to 
50mM Tris-HCl (pH 8.0), IM NaCl, 2mM EGTA. 

Protein samples were evaluated for size and purity by SDS-PAGE using Tris-Glycine 4- 
20 20% acrylamide gradient gels. Gels were stained with silver stain or Sypro Orange (Molecular 
Probes). 

Example 3. Assaying DNA Polymerase And 3'-5' Exonuclc^ase Activities Of Pfu DNA 
Polymerase Mutants 

Pfu mutant preparations were assayed for DNA polymerase and 3 '-5* exonuclease 
25 activities as follows. 

DNA polymerase, DNA polymerase activity was measured by monitoring incorporation 
of radiolabelled TTP into activated calf thymus DNA. A suitable DNA polymerase reaction 
cocktail contained: Ix PGR reaction buffer, 200nM each dATP, dCTP, and dGTP, IPSfiM TTP, 
5pM [3h]TTP (NEN #NET-221H, 20.5Ci/mmole; partially evaporated to remove EtOH), and 
30 250fxg/ml of activated calf thymus DNA (e.g., Phamiacia #27-4575-01). DNA polymerases (wt 
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Pfii or Pfu mutants) were diluted in Pfu storage buffer and of each enzyme dilution was 
added to 10|il aliquots of polymerase cocktail. Polymerization reactions were conducted in 
duplicate or triplicate for 30 minutes at 72®C. The extension reactions were quenched on ice, 
and then 5|a.l aUquots were spotted irmnediately onto DE81 ion-exchange filters (2.3cm; 

5 Whatman #3658323). Unincorporated [3H]TTP was removed by 6 washes with 2xSCC (0.3M 
NaCl, 30mM sodium citrate, pH 7,0), followed by a brief wash witii 100% ethanol. Incoiporated 
radioactivity was measured by scintillation counting. 

Reactions that lack enzyme were set up along with sample incubations to determine "total 
cpms" (omit filter wash steps) and "minimum cpms"(wash filters as above). Sample cpms were 
10 subtracted by minimum cpms to determine "corrected cpms" for each DNA polymerase. 

To determine percent (%) activity relative to wild type Pfii, -50-500ng of purified Pfu 
mutants were assayed in a nucleotide incorporation assay, alongside wild type Pfu diluted 
serially over the linear range of the assay (50-500pg; 0.003-0.03U). 

Exonuclease assays. Exonuclease reactions were performed (in triplicate) by adding 4fxl 
15 aliquots of diluted DNA polymerases (0,25-lOU wt Pfii; 5-200ng) to 46|J,1 of reaction cocktail. 
Reactions were incubated for 1 hour at 72''C. Reactions lacking DNA polymerase were also set 
up along with sample incubations to determine "total cpms" (no TCA precipitation) and 
"minimum cpms" (TCA precipitation, see below). 

Exonuclease reactions were stopped by transferring the tubes to ice. Sonicated salmon 
20 sperm DNA (150|xl; 2.5 mg/ml stock) and TCA (200|al; 10% stock) were added to all but the 

"total cpms" tubes. The precipitation reactions were incubated for >15 minutes on ice, and then 
spun in a microcentrifiige at 14,000rpm for 10 minutes. 200^1 of the supernatant was removed, 
being careful not to disturb the pellet, and transferred to scintillation fluid (Bio-Safe n™. 
Research Products Xntemational Corp.). The samples were thoroughly mixed by inversion and 
25 then counted in a scintillation coimter. 

To determine percent (%) exonuclease activity relative to wild type Pfu, equivalent 
amounts of Pfii and purified Pfii mutants (which fall in the linear range of the assaj^ '-5-200ng 
Pfii) are assayed in an exonuclease assay. 
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Results: Several Pfu mutants exhibited reductions in DNA polymerase activity compared 
to wild type Pfii, when tested as partially purified (--50% purity) preparations eluted fi-om nickel 
resins (Table 1). Pfu mutants showing <10% DNA polymerase activity and at least 10% 
exonuclease activity include the partitioning domain mutants: Y385QSNLH, G387SP, and 
5 G388P and the polymerase domain mutants: D405E, T542P, D543G, and K593T, The initial 
measurements of % DNA polymerase activity shown in Table 1 was considered as ^proximate 
estimates, due to the purity of the protein samples tested and uncertainties as to whether all 
protein amounts tested were in the linear range of the assay. 

Example 4. Purification Of Pfu DNA Polymerase Mutants By Conventional Column 
10 Chromatography 

The untagged or affinity-tagged fiisions of Pfu K593T and G387P mutants were purified 
as follows. Cells pellets (12-24 grams) were resuspended in 3 volumes of lysis buffer (buffer A: 
50mM Tris HCl (pH 8.2), ImM EDTA, and lOmM PME), Lysozyme (1 mg/g cells) and PMSF 
(ImM) were added and the cells were lysed for 1 hom at 4°C. The cell mixture was sonicated, 
15 and the debris removed by centrifugation at 1 5,000 ipm for 30 minutes (4°C). Tween 20 and 
Igepal CA-630 were added to final concentrations of 0.1% and the supernatant was heated at 
72**C for 10 minutes. Heat denatured E. coli proteins were then removed by centrifugation at 
15,000 rpm for 30 minutes (4**C). 

The supernatant was chromatographed on a Q-Sepharose™ Fast Flow column (—Sml 
20 column), equiUbrated in buffer B (buffer A plus 0.1% (v/v) Igepal CA-630, and 0.1% (v/v) 
Tween 20). Flow-through fractions were collected and then loaded directly onto a PI 1 
Phosphocellulose column (1.6 x 10cm), equilibrated in buffer C (same as buffer B, except pH 
7.5). The column was washed and then eluted with a 0-0.7M KCl gradient^uffer C. Fractions 
containing Pfu DNA polymerase mutants (951dD by SDS-PAGE) were dialyzed overnight against 
25 buffer D (50mM Tris HCl (pH 7.5), 5mM (JME, 5% (v/v) glycerol, 0.2% (v/v) Igepal CA-630, 
0.2% (v/v) Tween 20, and 0.5M NaCl) and then applied to a Hydroxyapatite column (1.0 x 1.3 
cm; --Iml), equilibrated in buffer D, The colunm was washed and Pfu DNA polymerase mutants 
were eluted with buffer D2 containing 400 mM KPO4, (pH 7.5), 5mM pME, 5% (v/v) glycerol, 
0.2% (v/v) Igepal CA-630, 0.2% (v/v) Tween 20, and 0,5 M NaCl, Purified proteins were spin 
30 concentrated using Cehtricon YM30 devices, and exchanged into Pfu final dialysis buffer 
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(50mM Tris-HCl (pH 8.2), O.lmM EDTA, ImM dithiothreitol (DTT), 50% (v/v) glycerol. 0.1% 
(v/v) Igepal CA-630, and 0.1% (v/v) Tween 20). 

Results: His-tagged and untagged Pfu G387P and K593T mutants were purified by ion 
exchange/hydroxy^petite (IE/HA) chromotography. The purified protein preps were analyzed 
5 by SDS-PAGE and determined to be of >95% purity. The IE/HA purified mutants were tested in 
a nucleotide incorporation assay to more precisely quantify percent remaining DNA polymerase 
activity. As shown in Table 3, the Pfu G387P mutant exhibits no significant DNA polymerase 
activity (<100 cpms above background) when up to 1.2|xg of protein was assayed. These results 
indicate that the Pfu G387P mutant exhibits <0.01% of the DNA polymerase activity exhibited 
10 by wild type Pfu DNA polymCTase. In comparison, the Pfu K593T mutant retains approximately 
1-2% of the DNA polymerase activity of wild type Pfu. 



Table 3. Residual Polymerase Activity m IE/HA Purified PJu Mutant Preps: 



Polyinerase 


Amount... 
^Assayed 
(ng) 


Corrected 
cpms . 


Relative (%) 

Polymerase. 
Activity 


Mean Relative Polymerase Activity 


HiS6-tagged mutant enzyme preps 






Wild type 


25 


16,661 


100 


100 


G387P 


240 


42 


0.026 


Cpms not significantly (<100cpms) 
above background; thCTefore, assume 
<100/16661 X 25/1200 = <0.01% 




600 


0 






1200 


16 


0.002 


K593T 


80 


1228 


2.3 


1.8 




200 


1774 


1.3 
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Untagged mutant enzyme prep 






Wild type 


2 


6134 


100 


100 


G387P 
Prqp J 


8.4 


60 


0.23 


Cpms not signincantly (<100cpms) 
above background; therefore, assume 
<100/6134 X 2/420 = <0.008% 


42 


0 




420 


8 


0.0006 



Example 5. Verifying The Presence Of Proofreading Activity In Pfu Mutants Under PCR 
Conditions 

A qualitative assay was used to verify that Hise-tagged Pfu mutants retained 3 '-5' 
5 exonuclease activity under PCR conditions. Li this assay, the 900bp Hal AT target is amplified 
with exo^ Pfii DNA polymerase (2.5U/50jal) using a forward primer containing a 3*dG, which 
produces a dG/dG mismatch upon annealing to the DNA template. The amplicon is amplified 
from human genomic DNA using the forward primer: 5'- 

GAG.GAaAGC.AGG.AAA,GGT.GGA.AG.-3^ [SEQ ID NO. 8] (100ng/50jal rxn) and the 
10 reverse primer: 5'-GAG.GTA.CAG,GGT,TGA.GGC.TACT.G - 3' [SEQ ID NO, 9] (lOOng/SOjal 
rxn). Amplification is carried out in the absence or presence of varying amounts of His6-'tagged 
Pfu mutants on a Perldn/Bfaner 9600 thermal cycler witti the following program: (1 cycle) 95°C 
for 2.5 minutes; (30 cycles) 95''C for 40 seconds, 61^C for 10 seconds, 72**C for 2.5 minutes; (1 
cycle) 72®C for 7 minutes. In the absence of proofreading activity, exo* Pfu produces low yields 
15 of product, presumably because the enzyme can not efficiently extend a dG/dG mismatch. In the 
presence of Pfu mutants vniSx proofreading activity, the 3MG should be excised from the primer, 
thereby allowing exo" Pfu to amplify the target in high yields. This PCR assay was used to 
verify that Pfu mutants tested in fideUty assaj^ retained sufficient proofreading activity under 
PCR conditions to excise mismatched PCR primers. Moreover, the assay allowed xis to 
20 determine the range of protein concentrations that could be added to PCR reactions without 
inhibition of amplification. 

Results: As shown in Figure 1, amplifications conducted with exo" Pfii alone produced 
low yields of product due to poor extension of the dG/dG mismatch. Product yields were 
significantly higher in the presence of the Hise-tagged Pfii G387P and K593T mutants, 
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presumably because these mutants excise the 3'dG from the primer, thereby allowmg exo' Pfii to 
efficiently amplify the target. Additional experiments showed that the polymerase deficient Pfii 
G387P and K593T mutants were unable to amplify tlxe target in the absence of exo" Pfii (or wild 
type Pfo). 

5 Example 6. PGR Amplification With Pfu Or Taq DNA Polymerase Blends Containing Pfu 
Mutants 

Pfu blends. PGR reactions were conducted under standard conditions in cloned Pfu PGR 
buffer (lOmM KGl, lOmM (NH4)2S04, 20mM Tris HGl (pH 8,8), 2mM Mg SO4, 0.1% Triton X- 
100, and 100p.g/ml BSA) with 2.5-5U l?faTi4rbo DNA polymerase (2.5U/^1 cloned Pfii DNA 

10 polymerase plus lU/jjl native or 2U/^1 cloned Pyrococcus fiiriosus dUTPase (PEF)) and varying 
concentrations of polymerase deficient Pfu mutants. For genomic targets 0.3-9kb in length, PGR 
reactions contained 2.5U VfaTurbo DNA polymerase, lOOng of human genomic DNA, 200|jM 
each dNTP, and lOOng of each primer. For genomic targets 11.9kb and 17kb in length, PGR 
reactions contained 5U YfuTurbo DNA polymerase, 250ng of human genomic DNA, SOOjiM 

15 each dNTP, and 200ng of each primer. 

Taq blends. PGR reactions were conducted under standard conditions in Herculase PGR 
buffer (50mM Tricine (pH 9.1), 8mM (NH4)2S04, 2.3inM MgGh, 0.1% Tween-20, and 75^g/ml 
BSA) with 2.5U cloned Taq DNA polymerase, lU of native or 2U cloned Pyrococcus furiosus 
dUTPase (PEF)), and varying concentrations of polymerase deficient Pfii mutants. 

20 Cycling Gonditions (Table 4): 



Target size (kb) 


Target gene 


Cycling Parameters 


0.3 


Aldolase B 


(1 cycle) 95°C 2 min 

(30 cycles) 95°C 40 sec, 58°C 30 sec, 72°C 1 min 
(1 cycle) ya^C 7 min 
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0.9 


HalAT 


(1 cycle) 95°C 2 min 

(30 cycles) 95°C 40 sec, 58°C 30 sec, 72°C 1 min 
(1 cycle) 72**C 7 min 


2.3 


Pfu pol 

(Sng plasmid 
DNA) 


(1 cycle) 95''C 2 min 

(30 cycles) 95*'C 40 sec, 58°C 30 sec, 72"»C 3 min 
(1 cycle) 72°C 7 min 


2.6 


HalAT 


(1 cycle) 95*0 2 min 

(30 cycles) 95°C 40 sec, 5S°C 30 sec, 72°C 3 min 
(1 cycle) 72°C 7 min 


4 


HalAT 


(1 cycle) 95°C 2 min 

(30 cycles)95°C 40 sec, 54''C 30 sec, 72''C 5 min 
(lcycle)72''C7min 


9.3 


HalAT 


(1 cycle)95»C2min 

(30 cycles) 95*C 40 sec, 58**C 30 sec, 72°C 18 min 
(1 cycle) 72»C 10 min 


11.9 


HalAT 


(lcycle)95»C2min 

(30 cycles) 95°C 40 sec, SS'C 30 sec, 72''C 24 min 
(1 cycle) 72"'C 10 min 
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1 7 


P glVlDlJUL 


(one cycle) 92°C 2 min 








(10 cycles) 92°C 10 sec, 63°C 30 sec, 68°C 30 min 






/OA QO^C^ 1 H 0*»r» A^^C^ "^Ck O/a/* /COO/^ OA /exilic* 

{Jaj cyciesj lu sec, oj sec, oo 3U tnui (pixis 






10 sec/cycle) 






(one cycle) 68°C 10 min 



Results (Pfu blend PGR performance): As shown in Figure 2, adding O.S^il of the Hisg- 
tagged Pfu G387P mutant to Pfu (in the presence of PEF/dUTPase), has minimal effects on PGR 
product yield. Additional experiments have shown that up to l.Sjxl of the Hise-tagged Pfu G387P 
5 mutant preparation can be added without significantly reducing PGR product yield. 

Results (Taq blend PGR performance): As shown in Figure 3, adding the Hise-tagged 
Pfu G387P mutant to Taq, in the presence of PEF/dUTPase, significantly increases PGR product 
yields when amplifications are performed in a reaction buffer that supports the activity of both 
Taq and Pfu DNA polymerases. One such buffer is the Herculase PGR buffer, which was 

10 developed specifically for Herculase Enhanced DNA polymerase (3 .33U/|li1 cloned Pfii, 1 .67U/|xl 
cloned Taq, 2U/p.l cloned Pyrococcus fiiriosus dUTPase), In the example shown in Figure 3, a 
4kb target could not be amplified in high yield usmg Taq alone in Taq, Pfu, or Herculase PGR 
buffer. In the presence of the His6-tagged Pfu G387P mutant (and dUTPase), the 4 kb target 
could be amplified in cloned Pfu buffer (moderate yield) but not Taq buffer, consistent with the 

15 buffer preferences of tiie Pfii G387P mutant. Other experiments have shown that the Pfu G3 87P 
mutant mhibits PGR reactions carried out with Taq in Taq PGR buffer, suggesting that the Pfu 
G387P mutant binds the 3' ends of PGR products without excising mismatches and dissociating 
(due to inactivity in Taq buffer), and blocks further product extension. As expected, highest 
product yields are obtained with Taq plus Pfu G387P blends in the presence of Herculase buffer, 

20 since both enzymes are highly active in this particular buffer. The Pfu G387P mutant is thought 
to enhance the yields of Taq PGR reactions (in buffers where Pfu is active) by excising mispairs 
that would otherwise stall Taq. 
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Example 7. Measuring The Fidelity Of DNA Polymerase Blends Containing His6-tagged 
Pfu DNA Polymerase Mutants 

The error rates of Pfii and Taq blends containing the Hise-tagged Pfu G387P and ICSPST 
mutants were tested in the lad PGR fidelity assay described in Cline, J., Braman, J.C., and 
Hogrefe,H.H, (96) NAR 24:3546-3551. Briefly, a 1.9kb fragment encoding the lacIOlacZa 
target gene was amplified from pPRIAZ plasmid DNA using 2.5U VfaTurbo in cloned Pfii PGR 
buffer or 2.5U Taq in Taq or Herculase PGR buffer. Varying amounts of the Pfu G387P and 
K593T mutants were added to certain reactions. For comparative purposes, the /ac/ target was 
also ampUfied with Pfx: (Thermococcus sp. KOD DNA polymerase; Invitrogen) and Tgo 
(Thermococcus gorgonarius DNA polymerase; Roche) using the manufacturers* recommended 
PGR buffer. The /acl-containing PGR products were then cloned into lambda GTIO arms, and 
the percentage of lad mutants (MF, mutation frequency) was determined in a color screening 
assay, as described (Lundberg, K.S-, Shoemaker, D.D,, Adams, M,W.W., Short, J.M., Sorge, 
J.A., and Mathur, EJ. (1991) Gene 180:1-8). Error rates are expressed as mutation frequency 
per bp per duplication (MF/bp/d), where bp is the number of detectable sites in flie lad gene 
sequence (349) and d is the nvimber of effective target doublmgs. For each enzyme, at least two 
independent PGR amplifications were performed- 

Error rate measurements have shown that Pfu and PfiiTwrfea DNA polymerases exhibit an 
average error rate which is -2-fold lower than that of Vent, Deep Vent, and Pfx (KOD) DNA 
polymerases, 3 to 6-fold lower than those of DNA polymerase mixtures, and 6- to 12-fold lower 
than that of Taq DNA polymerase. 

Results (Pfu blend): As shown Table 5, adding 0.5-3m1 of the DE/HA-purified His6- 
tagged Pfu G387P mutant reduced the error rate of VfaTurbo DNA polymerase by 3.2 to 3.5-fold 
(assay 1) and by 1.8 to 2,8-fold (assay 2) in two independent fidelity assays. As discussed iii 
Example 5, up to 1 .5^1 of die lE/HA-purified ffise-tagged Pfii G387G mutant can be added to 
PGR reactions without significantly reducing PGR product yield. 

In comparison, adding O.Sjxl of the Pfii K593T mutant reduced the error rate of F&iTurbo 
DNA polym^ase slightly (40%), while the addition of l.Sfxl and 3.0jil increased error rate by 
2.8- and 73-fold, respectively. At these amounts, approximately 0.5-lU of additional DNA 
polymerase activity is added to the PGR reaction (Pfii K593T mutant exhibits 1-2% polymerase 
activity). The K593T mutation significantly increases the misincorporation or mispair extension 
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rate of Pfii, and when added at high amounts (corresponding to >0.51J), the Pfu K593T mutant 
dramatically increases the error rate of wild type Pfo. 

Results (Taq blend): As shown Table 6, adding 0.5|4,1 and 3.0^1 of the Pfii G387G 
mutant reduced the error rate of Taq DNA polymerase by 5.1- and 8.3-fold, respectively. 
Therefore, the error rate of Taq in the presence of the Pfu G387G mutant, can equal the error rate 
of Pfu alone. 
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Figure 6. Fidelity of Pju Blends Containing lE/HA Purified Hisg-i^/w Mutants: 



PGR 
Enzyme 


Bis-Pfii 
Mutant 


Mutant 
Amount 


Error rate* 
(xlO-^ 


Mean Relative 
Accuracy 


Assay 1 


Assay 2 


Pfu 


None 


- 


5.55, 


3.60 


1.0 




G387P 


0.5 


1.60 


2.06 


2.6 






1.5 


1.65 


1.18 


3.2 






2.0 


Nd 


1.30 


2.8 






3.0 


1.75 


Nd 


3.2 




K593T 


0.5 


3.9 


Nd 


1.4 






1-5 


15.7 


Nd 


0.4 






3.0 


40.3 


Nd 


0.1 


Tgo 


None 




nd 


6.10 ; 


0,6 


Taq 


None 




34.7 


19.0 


.0.2 


*meanof d 


luplicate measurements 



Table 6. Fidelity of Tag Blends Containing IE/HA Purified P/u Mutants: 



PGR 


His-Pfu 






Error rate* . 


Relative 






Mut 


(xio;^ 




Enzyme 


Mutant 


ant 




. Accuracy 
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imt 












(^1) 
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Tag 


None 




34.7 


0.16 




G387P 


0.5 


6.8 


0.82 






3.0 


4.2 


1.32 




K593T 


0.5 


37.0 


0.15 


PJu 


None 


- 


5.6 


1.0 




G387P 


0.5 


1.60 


3.47 






3.0 


1.75 


3.17 




K593T 


0.5 


3.90 


1.42 



*mean of duplicate measurements 



Example 8. Determining Tlie TA Cloning Efficiencies Of PCR Products Amplified Witli 
Taq In Hie Presence Of Pfu Mutants 

To detennine the effects of polymerase deficient Pfu mutants on the temiinal transferase 
activity of Taq, we amplified a series of amplicons with Taq in the absence of the Pfu G387P 
mutant (in Taq PGR buffer) or in tiie presence of the Pfii G387P mutant (in Herculase PGR 
buflEer). Similar amplifications were performed using PfuTiw-fto and Herculase in their 
recommended PGR buffers. PGR product yields were quantified by analyzing the products on 
1% agarose gels, stained with SYBR gold. Hie same amount of each PGR product was added to 
Ipl of ttie pCR 2.1-TOPO vector (Invitrogen) in a final reaction volume of 6^1, according the 
manual for the TOPO TA Qoning Kit (#K4500-01). The reactions were incubated for 5 mmutes 
at room temperature, and then transferred to ice. The reactions were transfonned into One-Shot 
cells (Invitrogen), according to the manufecture's recommendations. Aliquots of each 
transformation were plated on ampUcillin/IPTG/X-gal plates, prepared as described in the 
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Invitrogen TOPO TA Cloning manual. The frequency of clones containing the desired insert (% 
cloning efficiency) was quantified as the number of (white coloiiies)/(total number of colonies 
plated). 

Results: As shown in Table 7, PGR products amplified with Taq in the presence of the Pfii 
5 G387P mutant are cloned into the TOPO TA cloning vector as efficiently as PGR products 

amplified with Taq alone. In contrast, PGR products amplified with IP&iTurbo DNA polymerase 
are cloned into the TOPO TA cloning vector much less efficiently, presimiably due to the lack of 
3' dAs. As discussed in Example 7, PGR products ampUfied with Taq blends containing the Pfii 
G387P mutant, should also exhibit fewer errors (5- to 8-fold less) compared to PGR products 

10 amplified with Taq alone. Therefore, Taq blends containing the Pfii G387P mutant should be 

usefiil to researchers using TA cloning metliods, but desiring high-fidelity amplication of inserts. 
The high TA cloning efficiencies obtained in the presence of the Pfii G387P mutant indicates 
that 3'dAs added by Taq during PGR are unexpectedly resistant to exonucleolytic degradation. 
Presumably, Pfii DNA polymerase is not very efficient at excising 3'dA residues from double- 

1 5 stranded PGR products in the presence of nucleotides. 
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PGR 
Product 
(bp) 


PGR enzym^lend 


Gloning 

efficiency 

(%) 


DNA polymerase 

ST y 


His6-i^ mutant 




mutant 


amount 
(Hi) 


900 


Taq 


none 




89 




G387P 


0.5 


80 








89 


Pfu 

J. J 






8 


300 


Trtn 

J, Ui£ 
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73 




\JJ<J / X 




78 


Pfu 






33 


Herculase 


None 




46 


2300 


Taq 


None 




83 




G387P 


0.5 


88 




G387P 


3.0 


92 


Pfi4 


None 




22 


Herculase 


None 




85 



Example 9, Expression And Activity Of Untagged Pfu Mutants 

The His6- tag was deleted from the His6-tagged Pfu G387P clone and the xmtagged 
mutant was expressed and purified as described in Example 4. Four Pfii G387P mutant samples 
were prepared and their protein concentrations determined by amino acid analysis. Exonuclease 
activity was measured using ^UrE. coli genomic DNA as substrate and the specific exonuclease 
activities of the mutant preparations are conq>ared to that of wild type Pfu in Table 8. The 
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Specific exonuclease activities of the Pfu G387P mutant preparations ranged jfrom 1300 to 2200 
U/mg, and appeared to be somewhat higher than that of wild type Pfu (350-95OU/mg). 



Table 8. Exonuclease specific activity of Pfii G387P Preparations 



DNA 

roiyinerase 


Lot/prep # 


Protein 


Exonuclease 


Exonuclease 


Polymerase 
activitv 










(# assays) 


m/fiV\ 


Pfo 


118.4447 


-0.05 


0.0174 


348 (1) 


2.5 


Pfu 


SCS 61 


2.29 


2,176 


950 (1) 


250 


PfiiG387P 


J 


4.17 


8.72 


2090 (5) 


0 


Pfu G387P 


SCS 1 


6.8 


8.86 


1320 (2) 


0 


Pfu G387P 


SCS 2 


3.0 


5.87 


1957 (1) 


0 


PfuGSSTP 


SCS 3 


2.6 


5,70 


2192 (1) 


0 



5 Example 10. Measuring The Fidelity Of DNA Polymerase Blends Containing The 
Untagged Pfu G387P Mutant 

The error rates of Pfii and Tag blends containing the untagged Pfu G387P mutant 
preparations were tested in the lad PGR fidelity assay as described in Example 7. As shown in 
Figure 4, the highest reductions in error rate (--S-fold) were observed when 6 to lOng ofPJic 

10 G387P prep J was added to 2.5U Pfii (SOjil reaction). Unexpectedly, fidelity appeared to 

decrease with increasing amounts (>10ng) of PJu P387G mutant The yield of lad ampUcon also 
decreased with increasing amount of PJu P387G mutant, suggesting that lower fidelity may m 
some way be correlated with reduced yield. Using prep J, optimal fidelity (lowest error rate) was 
achieved by adding 0,0125U to 0.0208U of exonuclease activity (prep J; 2090U/mg), which is 

15 the amount of 3'-5' exonuclease activity exhibited by - 1-3U of wild type Pfu. These 

assumptions are based upon Pfii exhibiting a specific activity of 348-950U exonuclease/mg and 
exo/pol ratios of 0.0174U/2.5U-0,02176U/2.5U, see Table 7. 
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Additional testing with G387P preparations SCS 1-3 showed that 6-24 ng or amounts of 
protein equivalent to 0,0125U, 0.0209U, or 0.03 14U of prep J consistently reduced the error rate 
of PfuTurbo DNA polymerase by -S-fold (Figure 5). There was minimal variation in error rate 
with lot of PfiiTurbo DNA polymerase employed (lots #59, 61, 63). 

As shown in Figure 6, adding 6ng to 60ng Pfu G387P prep J reduced the error rate of Tag 
DNA polymerase by 4.4- to 12.6-fold. Maximimi reduction in error rate was achieved by adding 
40ng of prep J, or the equivalent of 0.083 6U of exonuclease activity. In this assay, the accuracy 
of the Taq + 40ng Pfu G387P blend was 50% higher than that of PfuTurbo DNA polymerase. 

Example 11. Range Of Ratios Of Exonuclease And Polymerase Activities To Use In Blends 



Enzyme blend 


Polymerase proficient enzyme 


Polymerase deficient enzyme 


Polymerase 


Amount 
Polymerase 
(3'-5' Exo) 


Pfii 

Mutant 


Range of Amounts Tested that Produce 
Highest Fidelity and Yield 


Ng 


Polymerase (U) 


3 '-5' Exo (U) 


Pfa/ 

PfuTurbo 


2.5 

(0.02U exo) 


G387P 


5.7-24 
4 preps 


<0.01 


0.008-0.0314 


Taq 


2.5U 
(OUexo) 


G387P 


20-40 
pr^ J 


<0.01 


0.0418-0.0836 



OTHER EMBODIMENTS 

The foregoing exaxiq)les demonstrate experiments perfomied and contemplated by the 
present inventors in making and carrying out the invention. It is believed that these examples 
include a disclosure of techniques which serve to both apprise the art of the practice of the 
invention and to demonstrate its usefulness. It will be appreciated by those of skill in the art that 
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the techniques and embodiments disclosed herein are preferred embodiments only that in general 
niunerous equivalent methods and techniques may be employed to achieve the same result. 

All of tixe references identified hereinabove, are hereby expressly incorporated herein by 
reference to the extent that they describe, set forth, provide a basis for or enable compositions 
and/or methods which may be important to the practice of one or more embodiments of the 
present inventions. 
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1 . An enzyme mixture comprising a first enzyme and a second enzyme, wherein said first 
enzyme comprises a DNA polymerization activity, and said second enzyme is a mutant Pfu DNA 
polymerase comprising one or more mutations at amino acid positions selected from the group 

5 consisting of: D405, Y410, T542, D543, K593, Y595, Y385, G387, and G388. 

2. The enzyme mixture of claim 1, wherein said first enzyme is a DNA polymerase or a 
reverse transcriptase. 

3. The enzyme mixture of claim 2, wherein said DNA polymerase is selected from the 
group consisting of: Taq DNA polymerase, Tth DNA polymerase, UlTma DNA polymerase, Tli 

1 0 DNA polymerase, Pfii DNA polymerase, KOD DNA polymerase, JDF-3 DNA polymerase, 
PGB-D DNA polymerase and DP1/DP2 DNA polymerase. 

4. The enzyme mixture of claim 1, wherein said mutant Pfu DNA polymerase comprises 
one or more mutations selected from the group consisting of: D405E, Y410F, T542P, D543G, 
K593T, Y595S, Y385Q, Y385S, Y385N, Y385L, Y385H, G387S, G387P, and G388P. 

15 5. The enzyme mixture of claim 4, wherein said mutant Pfu DNA polymerase comprises a 
mutation of G387P. 

6. The enzyme mixture of claim 5, further comprising a PGR enhancing factor and/or an 
additive. 

7. The enzyme mixture of claim 1 , further comprising a PGR enhancing factor and/or an 
20 additive. 

8. The enzyme mixture of claim 1, wherein said enzyme mixture has a ratio of 
polymerization activity/exonuclease activity of (2.5-5U)/(0,02-5U). 

9. The enzyme mixture of claim 8, wh^ein said enzyme mixture has a ratio of 
polymerization activity/exonuclease activity of (2.5U)/(0.04-0.08U). 

25 10. An enzyme mixture comprising a first enzyme and a second enzyme, wherein said second 
enzyme is a mutant DNA polymerase selected from the group consisting of: a mutant Tgo DNA 
polymerase comprising one or more mutations at amino acid positions selected from the group 
consisting of: D404, Y409, T541, D542, K592, Y594, Y384, G386, and G387; a mutant KOD 
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DNA polymerase comprising one or more mutations at amino acid positions selected from the 
group consisting of: D404, Y409, T541, D542, K592, Y594, Y384, G386, andG387; a mutant 
Vent DNA polymerase comprising one or more mutations at amino acid positions selected from 
the group consisting of: D407, Y412, T544, D545, K595, Y597, Y387, G389, and G390; a 
5 mutant Deep Vent DNA polymerase comprising one or more mutations at amino acid positions 
selected from the group consisting of: D405, Y410, T542, D543, K593, Y595, Y385, G387, and 
G388- 

1 1 . The enzyme mixture of claun 1 0, wherein said first enzyme is a DNA polymereise or a 
reverse transcriptase. 

10 12. The enzyme mixture of claim 1 1, wherein said DNA polymerase is selected from the 

group consisting of: Taq DNA polymerase, Tth DNA polymerase, UlTma DNA polymerase, TU 
DNA polymerase, Pfii DNA polymerase, KOD DNA polymerase, JDF-3 DNA polymerase, 
PGB-D DNA polymerase and DP1/DP2 DNA polymerase. 

13. An enzyme mixture comprising a first, a second, and a third enzyme, wherein said second 
15 enzyme and said third enzjmies are difference enzymes selected from the group of mutant Pfii 
DNA polymerase, mutant Tgo DNA polymerase, mutant KOD DNA polymerase, mutant Vent 
DNA polymerase, and mutant Deep Vent DNA polymerase, wherein said mutant Pfii DNA 
polymCTase comprises one or more mutations at amino acid positions selected from the group 
consisting of: D405, Y410, T542, D543, K593, Y595, Y385, G387, and G388; said mutant Tgo 
20 DNA polymerase comprises one or more mutations at amino acid positions selected from the 

group consisting of: D404, Y409, T541, D542, K592, Y594, Y384, G386, and G387; said mutant 
KOD DNA polymerase comprises one or more mutations at amino acid positions selected from 
the group consisting of: D404, Y409, T541, D542, K592, Y594, Y384, G386, and G387; said 
mutant Yeat DNA polymerase comprises one or more mutations at amino acid positions selected 
25 from the group consisting of: D407, Y412, T544, D545, K595, Y597, Y387, G389, and G390; 
said mutant Deqp Vent DNA polymerase comprising one or more mutations at amino acid 
positions selected from the group consisting of: D405, Y410, T542, D543, K593, Y595, Y385, 
G387,andG388. 

14- The enzyme mixture of claim 13, wherein said first enzyme is a wild-type DNA 
30 polymerase. 
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1 5 . The enzyme mixture of claim 14, wherein said first enzyme is a DNA polymerase having 
no 3 '-5' exonuxlease activity or a DNA polymerase having a 3'-5' exonuxlease activity. 

16. The enzyme mixture of claim 1 5, wherein said DNA polymerase having no 3 '-5' 
exonuxlease activity is Taq DNA polymerase. 

17. Tlie enzyme mixture of claim 1 5, wherein said DNA polymerase having a 3 *-5 ' 
exonuxlease activity is selected from the group of Pfu DNA polymerase, Tgo DNA polymerase, 
KOD DNA polymerase. Vent DNA polymerase, and Deep Vent DNA polymerase. 

1 8. The enzyme mixture of claim 1 3, wherein said second enzyme is a mutant JDF-3 or a 
mutant KOD DNA polymerase and second thud enzyme is a muatnt Pfu DNA polymerase. 

19. The enzyme mixture of claim 1 8, wherein said muatnt JDF-3, KOD or Pfu DNA 
polyemrase comprises a mutation of G387P. 

20. An enzyme mixture comprising a first enzyme and a second enzyme, wherein said first 
enzyme is a wild type Pfii DNA polymerase, said second enzyme is a mutant Pfu DNA 
polymerase comprising a 3 '-5' exonuclease activity and a reduced DNA polymerization activity. 

21 . The enzyme mixture of claim 20, wherein said mutant Pfu DNA polymerase comprises a 
mutation in its partitioning domain or ttie polymerase domain. 

22. An enzyme mixture comprising a first enzyme and a second enzyme, wherein said first 
enzyme is an Archaeal DNA polymerase, said second enzyme is a mutant Archaeal DNA 
polymerase comprising a 3 '-5' exonuclease activity and a reduced DNA polymerization activity. 

23 . The enzyme mixture of claim 22, wherein said mutant Archaeal DNA polymerase is 
derived firom a DNA polymerase selected firom the group consisting of: UlTma DNA 
polymerase, TTi DNA polymerase, Pfii DNA polymerase, KOD DNA polymerase, JDF-3 DNA 
polymerase, PGB-D DNA polymerase and DP1/DP2 DNA polymerase. 

24. The enzyme noixture of claim 1, wherein said first enzyme is a wild type Taq DNA 
polymerase or a wild type Pfii DNA polymerase. 

25 . An enzyme mixture comprising three or more enzymes, whwein at least one enzyme in 
said enzyme mixture is a mutant enzyme which comprises a 3 '-5' exonuclease activity and a 
reduced DNA polymerization activity. 
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26. The enzyme mixture of claim 25, wherein said mutant enzyme is a mutant Pfii DNA 
polymerase, 

27. The enzyme mixture of claim 26, wherein said mutant Pfu DNA polymerase comprises 
one or more mutations selected from the group consisting of: D405E, Y410F, T542P, D543G, 

5 K593T, Y595S, Y385Q, Y385S, Y385N, Y385L, Y385H, G387S, G387P, and G388P. 

28. The enzyme mixture of claim 27, wherein said mutant Pfii DNA polymerase comprises 
the mutation of G387P, 

29. The enzyme mixture of claim 26 or 27, wherein at least two enzymes in said mixture are 
mixed as an enzyme blend before being added to said enzyme mixture. 

10 30. The enzyme mixture of claim 29, wherein said enzyme blend comprises a wild-type Pfii 
DNA polymerase and a wild-type Taq DNA polymerase. 

3 1 . The enzyme mixture of claim 30, wherein said enzyme blend further comprises a PGR 
enhancing factor. 

32. A mutant Pfii DNA polymerase with reduced DNA polymerization activity, wherein said 
IS mutant Pfu DNA polymerase comprises one or more mutations at amino acid positions selected 

from the group consisting of: T542, D543, K593, Y595, Y385, G387, and G388. 

33. The mutant DNA polymerase of claim 32, wherein said mutant Pfu DNA polymerase 
comprises one or more mutations selected from the group consisting of: T542P, D543G, K593T, 
Y595S, Y385Q, Y385S, Y385N, Y385L, Y385H, G387S, G387P, and G388P. 

20 34. A composition comprising a mutant Pfu DNA polymerase, wherein said mutant DNA 
polymerase comprises one or more mutations at aiioino acid positions selected from the group 
consisting of: T542, D543, K593, Y595, Y385, G387, and G388. 

35. The composition of claim 34, wherein said mutant Pfu DNA polymerase comprises one 
or more mutations selected from the group consisting of: T542P, D543G, K593T, Y595S, 

25 Y385Q, Y385S, Y385N, Y385L, Y385H, G387S, G387P, and G388P. 

36. A mutant Pfii DNA polymerase produced by introducing a mutation in to a 
polynucleotide encoding a wild type Pfii DNA polymerase to produce a mutant Pfii DNA 
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polymerase which comprises one or more mutations at amino acid positions selected from the 
group consisting of: T542, D543, K593, Y595, Y385, G387, and G388. 

37. A mutant Pfu DNA polymerase comprising a reduced DNA polymerization activity, 
wherein said mutant Pfu DNA polymerase is produced by the steps: 

5 (a) providing a polynucleotide encoding a wild-type Pfu DNA polymerase; 

(b) introducing one or more nucleotide mutations into said polynucleotide to produce a mutant 
polynucleotide encodmg said mutant Pfu DNA polymerase; and 

(c) expressing said mutant polynucleotide to produce said mutant Pfu DNA polymerase, wherein 
said mutant Pfii DNA polymerase comprises one or more mutations at amino acid positions 

10 selected from the group consisting of: T542, D543, K593, Y595, Y385, G387, and G388, 

38. The mutant DNA polymerase of claim 37, wherem said mutant Pfu DNA polymerase 
comprises one or more mutations selected from the group consisting of: T542P, D543G, K593T, 
Y595S, Y385Q, Y385S, Y385N, Y385L, Y385H, G387S, G387P, and G388P. 

39. A composition comprising a mutant Pfu DNA polymerase produced by expressing a 
15 polynucleotide encoding a Pfu DNA polymerase with a reduced DNA polymerization activity, 

wherein said mutant Pfu DNA polymerase comprises one or more mutations at amino acid 
positions selected from the group consistmg of: T542, D543, K593, Y595, Y385, G387, and 
G388. 

40. A composition comprising a mutant Pfu DNA polymerase comprising a reduced DNA 
20 polymerization activity, wherein said mutant Pfu DNA polymerase is produced by the steps: 

(a) introducing a mutation into a polynucleotide encoding a wild-type Pfii DNA polymerase to 
produce a mutant polynucleotide encoding said mutant Pfu DNA polym^ase comprising one or 
more mutations at amino acid positions selected from flie group coiisisting of: T542, D543, 
K593, Y595, Y385, G387, and G388; 

25 (b) expressing said mutant polynucleotide to produce said composition comprising said mutant 
Pfti DNA polymerase. 
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41 . The composition of claim 39 or 40, wherein said mutant Pfu DNA polymerase comprises 
one or more mutations selected from the group consisting of: T542P, D543G, K593T, Y595S, 
Y385Q, Y385S, Y385N, Y385L, Y385H, G387S, G387P, and G388P. 

42. A kit comprising a first enzyme, a second enzyme, and packaging material flierefor, 

5 wherein said first enzyme comprises a DNA polymerization activity, said second enzyme is a 
mutant Pfu DNA polymerase comprising one or more mutations at amino acid positions selected 
firom the group consisting of: D405, Y410, T542, D543, K593, Y595, Y385, G387, and G388. 

43- The kit of claim 42, wherein said first enzyme is a DNA polymerase or a reverse 
transcriptase. 

10 44. The kit of claim 43, wherein said DNA polymerase is selected firom the group consisting 
of: Taq DNA polymerase, Tth DNA polymerase, UlTma DNA polymerase, Tli DNA 
polymerase, Pfu DNA polymerase, KOD DNA polymerase, JDF-3 DNA polymerase, PGB-D 
DNA polymerase and DP1/DP2 DNA polymerase. 

45. The composition of claim 42, wherein said first enzyme is a wild type Taq DNA 
15 polymerase or a wild type Pfii DNA polymerase, 

46. A kit comprising a first enzyme and a second enzyme, and packaging material therefor, 
wherein said first enzyme is a wild type Pfu DNA polymerase, said second enzyme is a mutant 
Pfu DNA polymerase comprising a 3'-5' exonuclease activity and a reduced DNA 
polymmzation activity. 

20 47. The kit of claim 42, or 46, further comprising one or more components selected from the 
group consisting of: a deoxynucleotide, a reaction buffer, a PGR enhancing factor and/or an 
additive, a control DNA template and a control primer. 

48. The kit of claim 42, wherein said mutant Pfu DNA polymerase comprises one or more 
mutations selected from the group consisting of: D405E, Y410F, T542P, D543G, K593T, 

25 Y595S, Y385Q, Y385S, Y385N, Y385L, Y385H, G387S, G387P, and G388P. 

49. The kit of claim 36, wherein said enzyme mixture has a ratio of polymerization 
activity/exonuclease activity of (2.5-5U)/(0.02-5U), 
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50. The kit of claim 49, wherein said enzyme mixture has a ratio of polymerization 
activity/exonuclease activity of (2,5U)/(0.04-0.08U), 

51. The kit of claim 47, wherein said mutant Pfu DNA polymerase comprises a mutation of 
G387P. 

5 52. A kit comprising a mutant DNA polymerase which comprises a reduced DNA 
polymerization activity and packaging material therefor, wherein said mutant Pfu DNA 
polymerase comprises one or more mutations at amiuo acid positions selected from the group 
consisting of: T542, D543, K593, Y595, Y385, G387, and G3S8. 

53, The kit of claim 52, wherein said mutant Pfu DNA polymerase comprises one or more 
10 mutations selected from the group consisting of: T542P, D543G, K593T, Y595S, Y385Q, 

Y385S, Y385N, Y385L, Y385H, G387S, G387P, and G388P. 

54. An isolated polynucleotide encoding a mutant Pfu PNA polymerase, wherein said mutant 
Pfu DNA polymerase comprises one or more mutations at amino acid positions selected from the 
group consisting of: T542, D543, K593, Y595, Y385, G387, and G388. 

15 55. The isolated polynucleotide of claim 54, wherein said mutant Pfu DNA polymerase 

comprises one or more mutations selected from the group consisting of: Y410F, T542P, D543G, 
K593T, Y595S, Y385Q, Y385S, Y385N, Y385L, Y385H, G387S, G387P, and G388P. 

56. A pair of polynucleotides comprising a first and a second polynucleotides, wherein said 
second polynucleotide of said pair comprises a polynucleotide sequence encoding a mutant Pfu 

20 DNA polymerase which comprises one or more mutations at amino acid positions selected from 
the group consisting of: D40.5, Y410, T542, D543, K593, Y595, Y385, G387, and G388. 

57. The pair of polynucleotides of claim 56, wherein said second polynucleotide of said pair 
comprises a polynucleotide sequence encoding a mutant Pfu DNA polymerase which comprises 
one or more mutations selected from the group consisting of: D405E, Y410F, T542P, D543G, 

25 K593T, Y595S, Y385Q, Y385S, Y385N, Y385L, Y385H, G387S, G387P, and G388P. 

58. A method for DNA synthesis comprising: 

(a) providing an enzyme mixture, said enzyme mixture comprising a first enzyme 
comprising a DNA polymerization activity, and a second enzyme which is a mutant Pfu DNA 
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polymerase comprising one or more mutations at amino acid positions selected from the group 
consisting of: D405, Y410, T542, D543, K593, Y595, Y385, G387, and G388; and 

(b) contacting said enzyme mixture with a nucleic acid template, wherein said 
enzyme mixture pennits DNA synthesis. 

5 59. The method of claim 58, wherein said nucleic acid template is a DNA molecule. 

60. The method of claim 58, wherein said first enzyme is a DNA polymerase or a reverse 
transcriptase. 

61 . The method of claim 60, wherein said DNA polymerase is selected from the group 
consisting of: Taq DNA polymerase, Tth DNA polymerase, UlTma DNA polymerase, Tli DNA 

1 0 polymerase, Pfti DNA polymerase, KOD DNA polymCTase, JDF-3 DNA polymerase, PGB-D 
DNA polymerase and DP1/DP2 DNA polymerase. 

62. A method for TA cloning of DNA synthesis product comprising: 

(a) providing an exizyme mixture, said enzyme mixture comprising a Taq DNA 
polymerase as a first enzyme, and a mutant Pfu DNA polymerase as a second enzyme which 

15 comprises a 3'-5' exonuclease activity and a reduced DNA polymerization activity; 

(b) contacting said enzyme mixture with a nucleic acid template, wherein said 
enzyme mixture pennits DNA synthesis to generate a synthesized DNA product; and 

(c) inserting said synthesized DNA product into a TA cloning vector. 

63. The method of claim 62, wherein said mutant Pfu DNA polymerase comprises one or 
20 more mutations at amino acid positions selected from the group consisting of: D405, Y410, 

T542, D543, K593, Y595, Y385, G387, and G388. 

64. The method of claim 63, wherein said mutant Pfii DNA polymerase comprises one or 
more mutations selected from the group consisting of: D405E, Y410F, T542P, D543G, K593T, 
Y595S, Y385Q, Y385S, Y385N, Y385L, Y385H, G387S, G387P, and G388P. 

25 65. The method of claim 58, or 62, wherem said reaction mixture fiirther conqjrises a PGR 
enhancing factor and/or an additive. 
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Figure 1. PGR Proofreading Activity Assay 
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Figure 2. PCRPerfonnance of PJu plus Pfu G387P mutant blends 
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Figure 3 . PGR Performance of Tag plus Pfu G387P mutant blends 
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Figure 7 Polynucleotide and polypeptide sequences of various DNA polymerase 
mutants ?iccording to some, embodiments of the invention 

Partitioning Doiaain Mutants 

>Pfu wild type 

mildvdyiteegkpvirlf kkengkf kiehdrtf rpyiyallrddskieevkkitgerhgkivrivdvekvekkflg 

kpitvwklylehpqdvptirekvrehpawdifeydipfakrylidkglipmegeeelkilafdietlyhegeefgk 

gpiimisyadeneakvitwknidlpyvewsseremikrf Iriirekdpdiivtyngdsfdfpylakraeklgiklt 

igrdgsepkmqrigdmtavevkgrihfdlyhvitrtinlptytleavyeaifgkpkekvyadeiakawesgenlerv 

akysmedakatyelgkeflpmeiqlsrlvgqplwdvsrsstgnlvewf llrkayernevapnkpseeeyqrrlresy 

tggfvkepekglwenivyldfralypsiiithnvspdtlnlegcknydiapqvghkfckdipgfipsllghlleerq 

kiktkinketqdpiekilldyrqkaikllansfygyygyakarwyckecaesvtawgrkyielvwkeleekf gf kvly 

idtdglyatipggeseeikkkalefvkyinsklpglleleyegfykrgffvtkkryavideegkvitrgleivrrdw 

seiaketqarvletilkhgdveeavrivkeviqklanyeippeklaiyeqitrplheykaigphvavakklaakgvk 

ikpgmvigyivlrgdgpisnrailaeeydpkkhkydaeyyienqvlpavlrilegfgyrkedlryqktrqvgltswl 
nikks // [SEQ ID NO. 19] 

>Pfu Y385N 

mildvdyiteegkpvirlf kkengkf kiehdrtf rpyiyallrddskieevkkitgerhgkivrivdvekvekkflg 

kpitvwklylehpqdvptirekvrehpawdifeydipfakrylidkglipmegeeelkilaf dietlyhegeefgk 

gpiimisyadeneakvitwknidlpyvevvsseremikrflriirekdpdiivtyngdsfdfpylakraek 

igrdgsepkmqrigdmtavevkgrihfdlyhvitrtinlptytleavyeaifgkpkekvyadeiakawesgenlerv 

akysmedakatyelgkef Ipmeiqlsrlvgqplwdvsrsstgnlvewfllrkayernevapnkpseeeyqrrlresn 

tggfvkepekglwenivyldfralypsiiithnvspdtlnlegcknydiapcprghkfckdipgfipsllghlleerq 

kiktkmketqdpiekilldyrqkaikllansfygyygyakarwyckecaesvtawgrkyielvwkeleekfgfkvly 

idtdglyatipggeseeikkkalefvkyinsklpglleleyegfykrgffvtkkryavideegkvitrgleivrrdw 

seiaketqarvletilkhgdveeavrivkeviqklanyeippeklaiyeqitrplheykaigphvavakklaakgvk 

ikpgiuvigyivlrgdgpisnrailaeeydpkkhkydaeyyienqvlpavlrilegfgyrkedlryqktrqvgltswl 
nikks [SEQ ID NO. 20] 

>Pfu Y385L 

mildvdyiteegkpvirlf kkengkf kiehdrtf rpyiyallrddskieevkJcitgerhgkivrivdvekvekkflg 
kpitvwklylehpqdvptirekvrehpawdifeydipfakrylidkglipmegeeelkilaf dietlyhegeefgk 
gpiimisyadeneakvitwknidlpyvevvssereinikrf Iriirekdpdiivtyngdsfdfpylakraeklgiklt 
igrdgsepkmqrigdratavevkgrihfdlyhvitrtinlptytleavyeaifgjqpkekvyadeiakawesgenlerv 
akysmedakatyelgkef Ipmeiqlsrlvgqplwdvsrsstgnlvewfllrkayemevapnkpseeeyqrrlresl 
tggfvkepekglwenivyldfralypsiiithnvspdtlnlegcknydiapqvghkfckdipgfipsllghlleerq 
kiktkmketqdpiekilldyrqkaikllansfygyygyakarwyckecaesvtawgrkyielvwkeleekfgf kvly 
idtdglyatipggeseeikkkalefvkyinsklpglleleyegfykrgffvtkkryavideegkvitrgleivrrdw 
seiaketqarvletilkhgdveeavrivkeviqklanyeippeklaiyeqitrplheykaigphvavakklaakgvk 
ikpgmvigyivlrgdgpisnrailaeeydpkkhkydaeyyienqvlpavlrilegfgyrkedlryqktrqvgltswl 
nikks tSEQ ID NO. 21] 

>Pfu Y385H 

mildvdyiteegkpvirlf kkengkf kiehdrtf rpyiyallrddskieevkkitgerhgkivrivdvekvekkflg 
kpitvwklylehpqdvptirekvrehpawdifeydipfakrylidkglipmegeeelkilaf dietlyhegeefgk 
gpiiiiiisyadeneakvitwknidlpyvevvsseremikrflriirekdpdiivtyngdsfdfpylakraeklgiklt 
igrdgsepkmqrigdmtavevkgrihfdlyhvitrtinlptytleavyeaifgkpkekvyadeiakawesgenlerv 
akysmedakatyelgkef Ipmeiqlsrlvgqplwdvsrsstgnlvewfllrkayernevapnkpseeeyqrrlresh 
tggfvkepekglwenivyldfralypsiiithnvspdtlnlegcknydiapqvghkfckdipgfipsllghlleerq 
kiktkmketqdpiekilldyrqkaikllansfygyygyakarwyckecaesvtawgrkyielvwkeleekfgfJcvly 
idtdglyatipggeseeikkkalefvkyinsklpglleleyegfykrgffvtkkryavideegkvitrgleivrrdw 
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seiaketqarvletilkhgdveeavrivkeviqklanyeippeklaiyeqitrplheykaigphvavakklaakgvk 

ikpgmvigyxvlrgdgpisnrailaeeydpkkhkydaeyyienqvlpavlrilegfgyrkedlryqktrovaltswl 
nikks [SEQ ID NO. 22] ^ j-sivyxi.£.wj. 



>Pfu Y385Q 



raildvdyateegkpvirlfkkengkfkiehdrtfrpyiyallrddskieevkkitgerhgkivrivdvekvekkfla 
kpitvwklylehpqdvptirekvrehpawdifeydipfakrylidkglipmegeeelkilafdietlyhegeefak 
gpiimasyadeneakvitwknidlpyvevvssereniikrflriirekdpdiivtyngdsfdfpylakraeklaiklt 
^?^^^^^P!f5^i?<^tayevkgrihfdlyhvitrtinlptytleavyeaifgkpkekvyadeiakawesgenlerv 
akysitiedakatyelgkeflpmeiqlsrlvgqplwdvsrsstgnlvewfllrkayernevapnkpseeeyqirlresa 
tggfvkepekglwenivyldfralypsiiithnvspdtlnlegcknydiapqvghkfckdipgfipsllghlleera 
kxktkmketqdpiekilldyrqkaikllansfygyygyakarwyckecaesvtawgrkyielvwkeleekfgfkvlv 
idtdglyatipggeseeikkkalefvkyinsklpglleleyegfykrgffvtkkryavideegkvitrgleivrrdw 
seiaketqarvletilkhgdveeavrivkeviqklanyeippeklaiyeqitrplheykaigphvavakklaakqvk 

>Pfu y385S 

mildvdyiteegkpvirlfkkengkfkiehdrtfrpyiyallrddskieevkkitgerhgkivrivdvekvekkfla 
kpitvwklylehpqdvptirekvrehpavvdifeydipfakrylidkglipmegeeelkilafdietlyhegeefak 
gpxiiuisyadeneakvitwknidlpyvewsseremikrflriirekdpdiivtyngdsfdfpylakraeklaiklt 
igrdgsepkmqrigdmtayevkgrihfdlyhvitrtinlptytleavyeaifgkpkekvyadeiakawesgenlerv 
akysmedakatyelgke£lpmei.qlsrlvgqplwdvsrsstgnlvewfllrkayernevapnkpseeeyqrrlress 
i^Swv^! H '^®rt7^i'*^''f^yP^^^^^*^''^P*^^^'^^®^'='^y^^^P*I^9^''f«'^dipgfipsll 

kaktkmketqdpiekilldyrqkaikllansfygyygyakarwyckecaesytawgrkyielvwkele 

idtdglyatipggeseeikkkalefvkyinsklpglleleyegfykrgffvtkkryavideegkvitrgleivrrdw 

s^aketqarvletilkhgdveeavrivkeviqklanyeippeklaiyeqitrplheykaigphvavakklaakgvk 

>Pfu G387S 

mildvdyiteegkpvirlfkkengkfkiehdrtfrpyiyallrddskieevkkitgerhgkivrivdvBkvekkflo 

kpxtvwklylehpqdvptirekvrehpawdifeydipfakrylidkglipmegeeelkilafdietlyhegeefak 

gpixnuLsyadeneakvitwknidlpyvevvsseremikrflriirekdpdilvtyngdsfdfpylakraeklqiklt 

igrdgsepkmqrigdmtavevkgrihfdlyhvitrtinlptytleavyeaifgkpkekvyadeiakawesgenlerv 

akysMedakatyelgkeflpmeiqlsrlvgqplwdvsrsstgnlvewfllrkayernevapnkpseeeyqrrlresv 

tsgfvkepekglwenivyldfralypsiiithnvspdtlnlegcknydiapqvghkfckdipgfipsllghlleerq 

kiktkmketqdpiekilldyrqkaikllansfygyygyakarwyckecaesvtawgrkyielvwkeleekfgfkvlv 

idtdglyatipggeseeikkkalefvkyinsklpglleleyegfykrgffvtkkryavideegkvitrgleivrrdw 

seiaketqarvletilkhgdveeavrivkeviqklanyeippeklaiyeqitrplheykaigphvavakklaakgvk 

ikpgmvigyxvlrgdgpisnrailaeeydpkkhkydaeyyienqwlpavlrilegfgyrkedlryqktravaltswl 
nxkks [SEQ ID NO. 25] m y-^i-awx 



mxldvdyxteegkpvxrlfkkengkfkiehdrtfrpyiyallrddskieevkkxtgerhgkivrivdvekvekkflg 
kpxtvwklylehpqdvptirekvrehpawdifeydipfakrylidkglipmegeeelkilafdietlyhegeefgk 
gpxxmxsyadeneakvitwknidlpyvewsseremikrflriirekdpdiivtyngdsfdfpylakraeklgiklt 
xgrdgsepkmqrigdmtavevkgrihfdlyhvitrtinlptytleavyeaifgkpkekvyadeiakawesgenlerv 
akysmedakatyelgkeflpmeiqlsrlvgqplwdvsrsstgnlvewfllrkayernevapnkpseeeyqrrlresv 

kxktkmketqdpiekilldyrqkaikllansfygyygyakarwyckecaesvtawgrkyielvwkeleekfgfkvly 
xdtdglyatxpggeseeikkkalefvkyinsklpglleleyegfykrgffvtkkryavideegkvxtrgleivrrdw 
seiaketqarvletilkhgdveeavrivkeviqklanyeippeklaiyeqitrplheykaigphvavakklaakgvk 
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ikpgmvigyivlrgdgpisnrailaeeydpkkhkydaeyyienqvlpavlrilegfgyrkedlrvQktrc^ 
nikks [SEQ ID NO. 26] ^ 

>Pfu G388A 

mildvdyiteegkpvirlfkkengkfkiehdrtfrpyiyallrddskieevkkitgerhgkivrivdvekvekkflg 

kpitvwklylehpqdvptirekvrehpawdifeydipfakrylidkglipraegeeelkilafdietlyhegeefgk 

gpiimisyadeneakvitwknidlpyvewsseremikrflriirekdpdiivtyngdsfdfpylakraeklgiklt 

igrdgsepkmqrigdmtavevkgrihfdlyhvitrtinlptytleavyeaifgkpkekvyadeiakawesgenlerv 

akysmedakatyelgkeflpmeiqlsrlvgqplwdvsrsstgnlvewfllrkayernevapnkpseeeyqrrlresy 

tgafvkepekglwenivyldfralypsiiithnvspdtlnlegcknydiapqvghkfckdipgfipsllghlleerq 

kiktkmketqdpiekilldyrqkaikllansfygyygyakarwyckecaesvtawgrkyielvwkeleekfgfkvly 

idtdglyatipggeseeikkkalefvkyinsklpglleleyegfykrgffvtkkryavideegkvitrgleivrrdw 

seiaketqarvletilkhgdveeavrivkeviqklanyeippeklaiyeqitrplheykaigphvavakklaakgvk 

ikpgiavigyivlrgdgpisnrailaeeydpkkhkydaeyyienqvlpavlrilegfgyrkedlrvQktravaltswl 
nikks [SEQ ID NO. 27] ^ ^ 

>Pfu G388P 

mildvdyiteegkpvirlfkkengkfkiehdrtfrpyiyallrddskieevkkitgerhgkivrivdvekvekkflg 

kpitvwklylehpqdvptirekvrehpawdifeydipfakrylidkglipmegeeelkilafdietlyhegeefg 

/gpiimisyadeneakvitwknidlpyvevTTssereiaikrflriirekdpdiivtyngdsfdfpylakraeklgiklt 

igrdgsepkmqrigdititavevkgrihfdlyhvitrtinlptytleavyeaifgkpkekvyadeiakawesgenlerv 

akysmedakatyelgkeflpmeiqlsrlvgqplwdvsrsstgnlvewfllrkayernevapnkpseeeyqrrlresy 

tgpfvkepekglwenivyldfralypsiiithnvspdtlnlegcknydiapqvghkfckdipgfipsllghlleerq 

kiktkmketqdpiekilldyrqkaikllansfygyygyakarwyckecaesvtawgrkyielvwkeleekfgf kvly 

idtdglyatipggeseeikkkalefvkyinsklpglleleyegfykrgffvtkkryavideegkvitrgleivrrdw 

sexaketqarvletilkhgdveeavrivkeviqklanyeippeklaiyeqitrplheykaigphvavakklaakgvk 

ikpgmvigyivlrgdgpisnrailaeeydpkkhkydaeyyienqvlpavlrilegfgyrkedlrvaktrcrv^al 
nikks [SEQ ID NO. 28] jr-i >d y wj. 

>Tgo wild type 

mildtdyitedgkpvirifkkengefkidydmfepyiyallkddsaiedvkkitaerhgttvrwraekvkkkflg 

rpievwklyfthpqdvpairdkikehpawdiyeydipfakrylidkglipmegdeelkralafdietlyhegeefae 

gpilmisyadeegarvitwknidlpyvdvvstekemikrflkvvkekdpdvlityngdnfdfaylkkrseklgvkfi 

Igregsepkiqrmgdrfavevkgrihfdlypvirrtinlptytleavyeaifgqpkekvyaeeiaqawetgeglerv 

arysmedakvtyelgkeffpmeaqlsrlvgqslwdvsrsstgnlvewfllrkayernelapnkpderelarrresya 

ggyvkeperglwenivyldfrslypsiiithnvspdtlnregceeydvapqvghkfckdfpgfipsllgdlleerqk 

vkkkmkatidpiekklldyrqraikilansfygyygyakarwyckecaesvtawgrqyiettireieekfgfkvlya 

dtdgffatipgadaetvkkkakefldyinaklpglleleyegfykrgffvtkkkyavideedkittrgleivrrdws 

eiaketqarvleailkhgdveeavrivkevteklskyevppeklviyeqitrdlkdykatgphvavakrlaargiki 

rpgtvisyivlkgsgrigdraipfdefdpakhkydaeyyienqvlpaverilrafgyrkedlryqktravalaawlk 
pkt [SEQ ID NO. 29] ^ ^ ^ 

>Tgo Y384N 

mildtdyitedgkpvirifkkengefkidydrnfepyiyailkddsaiedvkkitaerhgttvrwraekvkkkflg 

rpievwklyfthpqdvpairdkikehpavvdiyeydipfakrylidkglipmegdeelkmlafdietlyhegeefae 

gpiliidsyadeegarvitwknidlpyvdvvstekeiid.krflkvvkekdpdvlityngdnfdfaylk^ 

Igregsepkiqrmgdrfavevkgrihfdlypvirrtinlptytleavyeaifgqpkekvyaeeiaqawetgeglerv 

arysmedakvtyelgkeffpmeaqlsrlvgqslwdvsrsstgnlvewfllrkayernelapnkpderelarrresna 

ggyvkeperglwenivyldfrslypsiiithnvspdtlnregceeydvapqvghkfckdfpgfipsllgdlleerqk 

vkkkmkatidpiekklldyrqraikilansfygyygyakarwyckecaesvtawgrqyiettireieekfgfkvlya 

dtdgffatipgadaetvkkkakefldyinaklpglleleyegfykrgffvtkkkyavideedkittrgleivrrdws 

eiaketqarvleailkhgdveeavrivkevteklskyevppeklviyeqitrdlkdykatgphvavakrlaargikl 
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rpcftvisyivlkgsgrigdraipfdefdpakhkydaeyyienqvlpaverilrafgyrkedlryqktrqvglgawlk 
pkt [SEQ ID NO. 30] 

>Tgo Y384L 

mildtdyitedgkpvirif kkengef kidydrnf epyiyallkddsaiedvkkitaerhgttvrvvraekvkkkf Ig 

rpievwklyfthpqdvpairdkikehpavvdiyeydipfakrylidkglipmegdeelkmlafdietlyhegeefae 

gpilmisyadeegarvitwknidlpyvdv^stekemikrf Ikvvkekdpdvlityngdnfdfaylkkrseklgvkfi 

Igregsepkiqrmgdrfavevkgrihfdlypvirrtinlptytleavyeaifgqpkekvyaeeiaqawetgeglerv 

arysmedakvizyelgkef fpineaqlsrlvgqslwdvsrsstgnlvewfllrkayernelapnkpderelarrresla 

ggyvkeperglwenivyldfrslypsiiithnvspdtlnregceeydvapqvghkfckdfpgfipsllgdlleerqk 

vkklcmkatidpiekklldyrqraikilansfygyygyakarwyckecaesvtawgrqyiettireieekfgf kvlya 

dtdg£fatipgadaetvkkkakefldyinaklpglleleyegfykrgf fvtkkkyavideedkittrgleivrrdws 

eiaketqarvleailkhgdveeavrivkevteklskyevppeklviyeqitrdlkdykatgphvavakrlaargiki 

rpgtvisyivlkgsgrigdraipfdefdpakhkydaeyyienqvlpaverilrafgyrkedlryqktrqvglgawlk 
pkt [SEQ ID NO. 31] 

>Tgo Y384H 

mildtdyitedgkpvirif kkengef kidydrnf epyiyallkddsaiedvkkitaerhgttvrwraekvkkkflg 
rpievwklyfthpqdvpairdkikehpavvdiyeydipfakrylidkglipmegdeelkmlafdiejilyhegeefae 
gpilmisyadeegarvitwknidlpyvdwstekemikrflkwkekdpdylityngdnfdfaylkkrseklgvkfi 
Igregsepkiqnngdrfavevkgrihfdlypvirrtinlptytleavyeaifgqpkekvyaeeiaqawetgeglerv 
arysmedakvtyelgkef fpmeaqlsrlvgqslwdvsrsstgnlvewfllrkayernelapnkpderelarrresha 
ggyvkeperglwenivyldfrslypsiiithnvspdtlnregceeydvapqvghkfckdfpgfipsllgdlleerqk 
vkkkmkatidpiekklldyrqraikilansfygyygyakarwyckecaesvtawgrqyiettireieekfgf kvlya 
dtdgf fatipgadaetvkkkakefldyinaklpglleleyegfykrgffvtkkkyavideedkittrgleivrrdws 
eiaketqarvleailkhgdveeavrivkevteklskyevppeklviyeqitrdlkdykatgphvavakrlaargiki 
rpgtvisyivlkgsgrigdraipfdefdpakhkydaeyyienqvlpaverilrafgyrkedlryqktrqvglgawlk 
pkt [SEQ ID NO. 32] 

>Tgo Y384Q 

mildtdyitedgkpvirif kkengef kidydrnf epyiyallkddsaiedvkkitaerhgttvrwraekvkkkflg 

rpievwklyfthpqdvpairdkikehpavvdiyeydipfakrylidkglipmegdeelknaafdietlyheg 

gpilmisyadeegarvitwknidlpyvdwstekemikrflkwkekdpdvlityngdnfdfaylkkrseklgvkfi 

Igregseplciqrmgdrfavevkgrihfdlypvirrtinlptytleavyeaifgqpkekvyaeeiaqawetgeglerv 

arysmedakvtyelgkef fpmeaqlsrlvgqslwdvsrsstgnlvewfllrkayemelapnkpderelarrresqa 

ggyvkeperglwenivyldf rslypsiiithnvspdtlnregceeydvapqvghkfckdfpgfipsllgdlleerqk 

vkkkmkatidpiekklldyrqpraikilansfygyygyakarwyckecaesvtawgrqyiettireieekfgf kvlya 

dtdgffatlpgadaetvkkkakefldyinaklpglleleyegfykrgffvtkkkyavideedkittrgleivrrdws 

eiaketqarvleailkhgdveeavrivkevteklskyevppekiviyeqitrdlkdykatgphvavakrlaargiki 

rpgtvisyivlkgsgrigdraipfdefdpakhkydaeyyienqvlpaverilrafgyrkedlryqktrqvglgawlk 

pkt [SEQ ID NO. 33] 

>Tgo Y384S 

mildtdyitedgkpvirif kkengef kidydrnf epyiyallkddsaiedvkkitaerhgttvrwraekvkkkflg 
rpievwklyfthpqdvpairdkikehpawdiyeydipfakrylidkglipmegdeelkmlafdietlyhegeefae 
gpilmisyadeegarvitwknidlpyvdvvstekemikrflkvvkekdpdvlityngdnfdfaylkkrseklgvkfi 
Igregsepkiqrmgdrfavevkgrihfdlypvirrtinlptytleavyeaifgqpkekvyaeeiaqawetgeglerv 
arysmedakvtyelgkef fpmeaqlsrlvgqslwdvsrsstgnlvewfllrkayemelapnkpderelarrressa 
ggyvkeperglwenivyldf rslypsiiithnvspdtlnregceeydvapqvghkfckdfpgfipsllgdlleerqk 
vkkkrakatidpiekklldyrqraikilansfygyygyakarwyckecaesvtawgrqyiettireieekfgf kvlya 
dtdgffatipgadaetvkkkakefldyinaklpglleleyegfykrgffvtkkkyavideedkittrgleivrrdws 
eiaketqarvleailkhgdveeavrivkevteldskyevppeklviyeqitrdlkdykatgphvavakrlaargiki 
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rpgtvisyivlkgsgrigdraipfdefdpakhkydaeyyienqvlpaverilrafgyrkedlryqktravalaawl If 
pkt [SEQ ID NO. 34] h yj.yctv»j.K 

>Tgo G386S 

mildtdyitedgkpvirifkkengefkidydrnfepyiyallkddsaiedvkkitaerhgttvrwraekvkkkflcr 
rpxevwklyfthpqdvpairdkikehpawdiyeydipfakrylidkglipmegdeelkmlafdietlyheaeefae 
gpilmisyadeegarvitwknidlpyvdwstekemikrflkwkekdpdvlityngdnfdfaylkkrseklgvkfi 
IgregsepkiqrmgdrfavevkgrihfdlypvirrtinlptytieavyeaifgqpkekvyaeeiaqawetgealerT 
arysmedakvtyelgkeffpmeaqlsrlvgqslwdvsrsstgnlvewfllrkayernelapnkpderelarrresva 
sgyvkeperglwenivyldfrslypsiiithnvspdtlnregceeydvapqvghkfckdfpgfipsllgdlleerak 
vkkkmkatidpiekklldyrqraikilansfygyygyakarwyckecaesvtawgrqyiettireieekfcrfkvlva 
dtdgffatipgadaetvkkkakefldyinaklpglleleyegfykrgffvtkkkyavideedkittrgleivrrdws 
eiaketqarvleailkhgdveeavrivkevteklskyevppeklviyeqitrdlkdykatgphvavakrlaaraiki 

>Tgo G38 6P 

mildtdyitedgkpvirifkkengefkidydrnfepyiyallkddsaiedvkkitaerhgttvrvvraekvkkkfla 
rpievwklyfthpqdvpairdkikehpavvdiyeydipfakrylidkglipmegdeelkmlafdietlyheaeefae 
gpilnu.syadeegarvitwknidlpyvdwstekemikrflkwkekdpdvlityngdnfdfaylkkrseklavkfi 
Igregsepkiqxmgdrfavevkgrihfdlypvirrtinlptytleavyeaifgqpkekvyaeeiaqawetgealerv 
arysmedakvtyelgkeffpmeaqlsrlvgqslwdvsrsstgnlvewfllrkayernelapnkpderelarrresva 
pgyvkeperglwenivyldfrslypsiiithnvspdtlnregceeydvapqvghkfckdfpgfipsllgdlleerak 
vkkkmkatxdpxekklldyrqraikilansfygyygyakarwyckecaesvtawgrqyiet?ireiee?fgfkJlSa 
dtdgffatipgadaetvkkkakefldyinaklpglleleyegfykrgffvtkkkyavideedkittraleivrrdws 
eiaketqarvleailkhgdveeavrivkevteklskyevppeklviyeqitrdlkdykatgphvavakrlaarcriki 

>Tgo G387A 

mildtdyitedgkpvirifkkengefkidydrnfepyiyallkddsaiedvkkitaerhgttvrvvraekvkkkfla 
rpievwklyfthpqdvpairdkikehpawdiyeydipfakrylidkglipmegdeelkmlafdletlyheqeefae 
gpilmisyadeegarvitwknidlpyvdwstekemikrflkwkekdpdvlityngdnfdfaylkkrseklovkfi 
Igregsepkiqrmgdrfavevkgrihfdlypvirrtinlptytleavyeaifgqpkekvyaeeiaqawetgeqlerv 
arysmedakvtyelgkeffpmeaqlsrlvgqslwdvsrsstgnlvewfllrkayernelapnkpderelarrresva 
^u?!^®^®''^^*'®'^^^'^^''^^^®^^^^*^^^P*^"^^"^®5=®«y^^^P'J^ghkfckdfpgfipsllgdlleerCTk 
vkkkmkatidpiekklldyrqraikilansfygyygyakarwyckecaesvtawgrqyiettireieekfqfkvlva 
dtdgffatipgadaetvkkkakefldyinaklpglleleyegfykrgffvtkkkyavideedkittrgleivrrdws 
eiaketqarvleailkhgdveeavrivkevteklskyevppeklviyeqitrdlkdykatgphvavakrlaaraiki 

>Tgo G38 7P 

mlldtdyitedgkpvirifkkengefkidydrnfepyiyallkddsaiedvkkitaerhgttvrwraekvkklcflq 

rpievwklyfthpqdvpairdkikehpawdiyeydipfakrylidkgllpmegdeelJanlafdietlyhegeefae 

gpilmxsyadeegairvitwknidlpyvdwstekeinikrflkwkekdpdvlityngdnfdfaylkkrseklgvkfi 

Igregsepkiqrmgdrfavevkgrihfdlypvirrtinlptytleavyeaifgqpkekvyaeeiaqawetgeglerv " 

arysmedakvtyelgkeffpmeaqlsrlvgqslwdvsrsstgnlvewfllrkayernelapnkpderelarrresva 

gpyykeperglwenivyldfrslypsiiithnvspdtlnregceeydvapqvghkfckdfpgfipsllgdlleerqk 

vkkkmkatidpiekklldyrqraikllansfygyygyakarwyckecaesvtawgrqyiettireieekfgfkvlya 

dtdgffatipgadaetvkkkakefldyinaklpglleleyegfykrgffvtkkkyavideedkittrgleivrrdws 

eiaketqarvleailkhgdveeavrivkevteklskyevppeklviyeqitrdlkdykatgphvavakrlaarglkl 
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rpgtvisyivlkgsgrigdraipfdefdpakhkydaeyyienqvlpaverilrafgyrkedlryqktrqvalaaw 
pkt [SEQ ID NO. 38] jr jr-i m y yciwxit 

>KOD wild type 

mildtdyitedgkpvirifkkengefkieydrtfepyfyallkddsaieevkkitaerhgtwtvkrvekvqkkflg 

rpvevwklyfthpqdvpairdkirehgavidiyeydipfakrylidkglvpmegdeelkmlafdiqtlyhegeefae 

gpilraisyadeegarvitwknvdlpyvdvvsteremikrflrvvkekdpdvlityngdnfdfaylkkrceklginfa 

Igrdgsepkigrmgdrfavevkgrihfdlypvirrtinlptytleavyeavfgqpkekvyaeeitpawetgenlerv 

arysmedakvtyelgkeflpmeaqlsrligqslwdvsrsstgnlvewfllrkayernelapnkpdekelarrrqsye 

ggyvkeperglwenivyldfrslypsiiithnvspdtlnregckeydvapqvghrfckdfpgfipsllgdlleerqk 

ikkkmkatidpierklldyrqraikilansyygyygyararwyckecaesvtawgreyitmtikeieekygfkviys 

dtdgffatipgadaetvkkkameflnyinaklpgaleleyegfykrgffvtkkkyavideegkittrgleivrrdws 

eiaketqarvleallkdgdvekavrivkevteklskyevppeklviheqitrdlkdykatgphvavakrlaargvki 

rpgtvisyivlkgsgrigdraipfdefdptkhkydaeyyienqvlpaverilrafgyrkedlryqktrovolsawlk 
pkgt [SEQ ID NO. 39] - . H yxsawix 

>KOD Y384N 

mildtdyitedgkpvirifkkengefkieydrtfepyfyallkddsaieevkkitaerhgtwtvkrvekvqkkflg 

rpvevwklyfthpqdvpairdkirehgavidiyeydipfakrylidkglvpmegdeeikmlafdiqtlyhegeefae 

gpilmisyadeegarvitwknvdlpyvdvvstereitd.krflrv-g'kekdpdvlityngdnfdfaylkkrceU 

Igrdgsepkiqrmgdrfavevkgrihfdlypvirrtinlptytleavyeavfgqpkekvyaeeitpawetgenlerv 

arysmedakvtyelgkeflpmeaqlsrligqslwdvsrsstgnlvewfllrkayernelapnkpdekelarrrqsne 

ggyvkeperglwenivyldfrslypsiiithnvspdtlnregckeydvapqvghrfckdfpgfipsllgdlleerqk 

ikkkmkatidpierklldyrqraikilansyygyygyararwyckecaesvtawgreyitmtikeieekygfkviys 

dtdgffatipgadaetvkkkameflnyinaklpgaleleyegfykrgffvtkkkyavideegkittrgleivrrdws 

eiaketqarvleallkdgdvekavrivkevteklskyevppeklviheqitrdlkdykatgphvavakrlaargvki 

rpgtvisyivlkgsgrigdraipfdefdptkhkydaeyyienqvlpaverilrafgyrkedlxyaktrovalsaw 
pkgt tSEQ ID NO. 40] 

>KOD Y384L 

mildtdyitedgkpvirifkkengefkieydrtfepyfyallkddsaieevkkitaerhgtvvtvkrvekvqkkflg 

rpvevwklyfthpqdvpairdkirehgavidiyeydipfaJcrylidkglvpmegdeelknaafdiqtlyhege 

gpilmisyadeegarvitwknvdlpyvdvvsterend.krfli:vvkekdpdvlityngdnfdfaylkkrceklgi^ 

Igrdgsepkiqrmgdrfavevkgrihfdlypvirrtinlptytleavyeavfgqpkekvyaeeitpawetgenlerv 

arysmedakvtyelgkeflpmeaqlsrligqslwdvsrsstgnlvewfllrkayernelapnkpdekelarrrqsle 

ggyvkeperglwenivyldfrslypsiiitlinvspdtlnregckeydvapqvghrfckdfpgfipsllgdlleer^ 

ikkkmkatidpierklldyrqraikilansyygyygyararwyckecaesvtawgreyitmtikeieekygfkviys 

dtdgffatipgadaetvkkkameflnyinaklpgaleleyegfykrgffvtkkkyavideegkittrgleivrrdws- 

eiaketqarvleallkdgdvekavrivkevteklskyevppeklviheqitrdlkdykatgphvavakrlaargvki 

rpgtvisyivlkgsgrigdraipfdefdptkhkydaeyyienqvlpaverilrafgyrkedlryqktrqvqlsawlk 
pkgt [SEQ ID NO. 41] 

>KOD Y384H 

mildtdyitedgkpvirifkkengefkieydrtfepyfyallkddsaieevkkitaerhgtwtvkrvekvqkkflg 

rpyevwklyfthpqdvpairdkirehgavidiyeydipfakrylidkglvpmegdeelkmlafdiqtlyhegeefae 

gpiliaisyadeegarvitwknvdlpyvdvvstereiaikrflrvvkekdpdvlityngdnfdfaylkJcrceklginfa 

Igrdgsepkiqrmgdrfavevkgrilifdlypvirxtinlptytleavyeavfgqpkekvyaeeitpawetgenlerv 

arysraedakvtyelgkeflpmeaqlsrligqslwdvsrsstgnlvewfllrkayernelapnlqjdekelarrrqshe 

ggyvkeperglwenivyldfrslypsiiithnvspdtlnregckeydvapqvghrfckdfpgfipsl 

ikkkmkatidpierldldyrqraikilansyygyygyararwyckecaesvtawgreyitmtikeieekygfkviys 

dtdgffatipgadaetvkkkameflnyinaklpgaleleyegfykrgffvtkkkyavideegkittrgleivrrdws 

eiaketqarvleallkdgdvekavrivkevtelclskyevppeklviheqitrdlkdykatgphvavaJcrlaargvki 
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rpgtvisyivlkgsgrigdraipfdefdptldikydaeyyienqvlpaverilrafgyrkedlryqktrqvglsa 
pkgt [SEQ ID NO. 42] 

>KOD Y384Q 

mildtdyitedgkpvirif kkengef kieydrtfepyfyallkddsaieevkkitaerhgtwtvkrvekvqkkflg 

rpvevwklyfthpqdvpairdkirehgavidiyeydipfakrylidkglvpmegdeelkmlafdiqtlyhegeefae 

gpilmlsyadeegarvitwknvdlpyvdvvstereiuikrflrvvkekdpdvlityngdnfdfaylkkrcekl^^ 

Igrdgsepkiqrmgdrfavevkgrihfdlypvirrtinlptytleavyeavfgqpkekvyaeeitpawetgenlerv 

arysmedakvtyelgkeflpmeaqlsrligqslwdvsrsstgnlvewfllrkayernelapnkpdekelarrrqsqe 

ggyvkeperglwenivyldfrslypsiiithnvspdtlnregckeydvapqvghrfckdfpgfipsllgdlleerqk 

ikkkmkatidpierklldyrqraikilansyygyygyararwyckecaesvtawgreyitmtikeieekygf kviys 

dtdgffatipgadaetvkkkaineflnyinaklpgaleleyegfykrgffvtkkkyavideegkittrgleivrrdws 

eiaketqarvleallkdgdvekavrivkevteklskyevppeklviheqitrdlkdykatgphvavakrlaargvki 

rpgtvisyivlkgsgrigdraipfdefdptkhkydaeyyienqvlpaverilrafgyrkedlryqktrqvglsawlk 
pkgt [SEQ ID NO. 43] 

>KOD Y384S 

mildtdyitedgkpvirif kkengef kieydrtfepyfyailkddsaieevkkitaerhgtvvtvkrvekvqkkflg 

rpvevwklyfthpqdvpairdkirehgavidiyeydipfakrylidkglvpmegdeelkmlafdiqtlyhegeefae 

gpilmisyadeegarvitwknvdlpyvdwsteremikrf Irv^kekdpdvlityngdnfdfaylkkrceklgin^ 

Igrdgsepkiqnngdrfavevkgrihfdlypvirrtinlptytleavyeavfgqpkekvyaeeitpawetgenlerv 

arysmedakvtyelgkeflpmeaqlsrligqslwdvsrsstgnlvewfllrkayernelapnkpdekelarrrqsse 

ggyvkeperglwenivyldfrslypsiiithnvspdtlnregckeydvapqvghrfckdfpgfipsllgdlleerqk 

ikkkidkatidpierklldyrqraikilansyygyygyararwyckecaesvtawgreyitmtikeleekygf kviys 

dtdgffatipgadaetvkkkameflnyinaklpgaleleyegfykrgffvtkkkyavideegkittrgleivrrdws 

eiaketqarvleallkdgdvekavrivkevteklskyevppeklviheqitrdlkdykatgphvavakrlaargvJci 

rpgtvisyivlkgsgrigdraipfdefdptkhkydaeyyienqvlpaverilrafgyrkedlryqktrqvglsawlk 
pkgt [SEQ ID NO. 44] ' 

>KOD G386S 

mildtdyitedglqpvirif kkengef kieydrtfepyfyallkddsaieevkkitaerhgtwtvkrvekvqkkflg 

rpvevwklyfthpqdvpairdkirehgavidiyeydipfakrylidkglvpmegdeelkmlafdiqtlyhegeefae 

gpilmisyadeegarvitwknvdlpyvdwsteremikrflrwkekdpdvlityngdnfdfaylkkrceklginfa 

Igrdgsepkiqnugdrfavevkgrihfdlypvirrtinlptytleavyeavfgqpkekvyaeeitpawetgenlerv 

arysmedakvtyelgkeflpmeaqlsrligqslwdvsrsstgnlvewfllrkayernelapnkpdekelarrrqsye 

sgyvkeperglwenivyldfrslypsiiithnvspdtlnregckeydvapqvgtirfckdfpgfipsllgdlleerqk 

ikkkmkatidpierklldyrqraikilansyygyygyarar^fyckecaesvtawgreyitmtikeieekygf kviys 

dtdgffatipgadaetvkkkameflnyinaklpgaleleyegfykrgffvtkkkyavideegkittrgleivrrdws 

eiaketqarvleallkdgdvekavrivkevteklskyevppeklviheqitrdlkdykatgphvavakriaargvki 

rpgtvisyivlkgsgrigdraipfdefdptkhkydaeyyienqvlpaverilrafgyrkedlryqktrqvglsawlk 
pkgt [SEQ ID NO. 45] 

>KOD G386P 

mildtdyitedgkpvirif kkengef kieydrtfepyfyallkddsaieevkkitaerhgtvvtvkrvekvqkk^lg 
rpvevwklyfthpqdvpairdkirehgavidiyeydipfakrylidkglvpmegdeelkmlafdiqtlyhegeefae 
gpilmisyadeegarvitwknvdlpyvdwsteremikrflrwkekdpdvlityngdnfdfaylkkrceJclginfa 
Igrdgsepkiqnngdrfavevkgrihfdlypvirrtinlptytleavyeavfgqpkekvyaeeitpawetgenlerv 
arysmedaJcvtyelgkeflpmeaqlsrligqslwdvsrsstgnlvewfllrkayernelapnkpdekelarrrqsye 
pgyvkeperglwenivyldfrslypsiiithnvspdtlnregckeydvapqvghrfckdfpgfipsllgdlleerqk 
ikkkmkatidpierklldyrqraikilansyygyygyararwyckecaesvtawgreyitmtikeieekygf kviys 
dtdgffatipgadaetvkkkameflnyinaklpgaleleyegfykrgffvtkkkyavideegJcittrgleivrrdws 
eiaketqarvleallkdgdvekavrivkevteklskyevppeklviheqitrdlkdykatgphvavakrlaargvki 
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rpgtvisyivlkgsgrigdraipfdefdptkhkydaeyyienqvlpaverilrafgyrkedlryqktrqvglsaw 
pkgt [SEQ ID NO. 4 6] 

>KOD G387A 

mildtdyitedgkpvirifkkengefkieydrtfepyfyallkddsaieevkkitaerhgtvvtvkrvekvqkkflg 

3TDyevwklyfthpqdvpairdkirehgavidiyeydipfakrylidkglvpmegdeelkinlafdiqtlyhegeefae 

gpiliid.syadeegarvitwknvdlpyvdvvsteremikrflrvvkekdpdvlityngdnfdfaylkkrceklginfa 

Igrdgsepkiqirmgdrfavevkgrihfdlypvirrtinlptytleavyeavfgqpkekvyaeeitpawetgenlerv 

^arysraedakvtyelgkeflpmeaqisrligqslwdvsrsstgnlvewfllrkayernelapnkpdekelarrrqsye 

gayvkeperglwenivyldfrslypsiiithnvspdtlnregckeydvapqvghrfckdfpgfipsllgdlleerqk * 

ikkkmkatidpierklldyrqraikilansyygyygyararwyckecaesvtawgreyitmtikeieekygf kviys 

dtdgffatipgadaetvkkkameflnyinaklpgaleleyegfykrgffvtkkkyavideegkittrgleivrrdws 

eiaketqarvleallkdgdvekavrivkevteklskyevppeklviheqitrdlkdykatgphvavakrlaargvki 

rpgtvisyivlkgsgrigdraipfdefdptkhkydaeyyienqvlpaverilrafgyrkedlryqktrqvglsawlk 
pkgt [SEQ ID NO. 47] 

>KOD G387P 

mildtdyitedgkpvirif kkengefkieydrtfepyfyallkddsaieevkkitaerhgtwtvkrvekvqkkflg 
rpvevT^klyfthpqdvpairdkirehgavidiyeydipfakrylidkglvpmegdeelkialafdiqtlyhegeefae 
gpilmisyadeegarvitwknvdlpyvdvvsteremikrflrvvkekdpdvlityngdnfdfaylkkrceklginfa 
Igrdgsepkiqrmgdrfavevkgrihfdlypvirrtinlptytleavyeavfgqpkekvyaeeitpawetgenlerv 
arysmedakvtyelgkeflpmeaqlsrligqslwdvsrsstgnlvewfllrkayernelapnkpdekelarrrqsye 
gpyvkeperglwenivyldfrslypsiiithnvspdtlnregckeydvapqvghrfckdfpgfipsllgdlleerqk' 
ikkkmkatidpierklldyrqraikilansyygyygyararwyckecaesvtawgreyitmtikeieekygfkviys 
dtdgffatipgadaetvkkkameflnyinaklpgaleleyegfykrgffvtkkkyavideegkittrgleivrrdws 
eiaketqarvleallkdgdve kavrivke vt ekls kyevppe kl viheqitrdlkdykatgphvavakrlaargvki 

rpgtvisyivlkgsgrigdraipfdefdptkhkydaeyyienqvlpaverilrafgyrkedlryqktrqvqlsawlk 
pkgt [SEQ ID NO, 48] 

>Vent wild type 

mildtdyitkdgkpiirifkkengefkieldphfqpyiyallkddsaieeikaikgerhgktvrvldavkvrkkflg 

revevwklifehpqdvpamrgkirehpavvdiyeydipfakrylidkglipmegdeelkllafdietfyhegdefgk 

geiimisyadeeearvitwknidlpyvdvvsneremikrfvqyvkekdpdviityngdnfdlpylik^ 

Igrdkehpepkiqrmgdsfaveikgrihfdlfpwrrtinlptytleavyeavlgktksklgaeeiaaiweteesmk 

klaqysniedaratyelgkef fpmeaelakligqsvwdvsrsstgnlvewyllrvayarnelapnkpdeeeykrrlrt 

tylggyvkepekglweniiyldfrslypsiivthnvspdtlekegcknydvapivgyrfckdfpgfipsilgdliam 

rqdikkkmkstidpiekkmldyrqrailcllansyygymgypkarwyskecaesvtawgrhyiemtireieekfgfkv 

lyadtdgfyatipgel<pelikkkakeflnyinsklpglleleyegfylrgffvtkkryavideegrittrglevvrr 

dwseiaketqakvleailkegsvekavevvrdvvekiakyrvpleJclviheqitrdlkdykaigphvaiakrlaarg 

ikvkpgtiisyivlkgsgkisdrvillteydprkhkydpdyyienqvlpavlrileafgyrkedlryqsskqtqlda 
wlkr [SEQ ID NO. 4 9] 

>Vent Y387N 

mildtdyitkdgkpiirifkkengefkieldphfqpyiyallkddsaieeikaikgerhgktvrvldavkvrkkflg 

reyevwklifehpqdvpaiargkirehpavvdiyeydipfa3crylidkglipmegdeelkllafdietfy 

geiimisyadeeearvitwJcnidlpyvdvvsneremikrfvqwkekdpdviityngdnfdlpyl 

Igrdkehpepkiqrmgdsfaveikgrihfdlfpvvrrtinlptytleavyeavlgktksklgaeeiaaiweteesmk 

klaqysmedaratyelgkeffpmeaelakligqsvwdvsrsstgnlvewyllrvayaimelapnJqpdeeeyJcrrlirt 

tnlggyvkepekglweniiyldfrslypsiivthnvspdtlekegcknydvapivgyrfckdfpgfipsilgdliam 

rqdikk)aakstidpiekkmldyrqrai)cllansyygymgypkarwyskecaesvtawgrhyiemtireieek^ 

lyadtdgfyatipgekpelikkkakeflnyinsJdpglleleyegfylrgffvtkkryavideegrittrglewrr 

dwseiaketqakvleailkegsvekavevvrdvvekiakyrvplelclviheqitrdlkdykaigphvaialcrlaarg 
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ikvkpgtiisyivlkgsgkisdrvillteydprkhkydpdyyienqvlpavlrileafgyrkedlryqsskqtglda 
wlkr [SEQ ID NO. 50] 

>Vent Y387L 

mildtdyitkdgkpiirif kkengefkieldphfqpyiyallkddsaieeikaikgerhgktvrvldavkvrkkflg 
revevwklifehpqdvpamrgkirehpawdiyeydipfakrylidkglipmegdeelkllafdietfyhegdefgk 
geiiiuisyadeeearvitwknidlpyvdwsnereraikrfvqwkekdpdviityngdnf dlpylikraeklgvrlv 
Igrdkehpepkiqrmgdsfaveikgrihfdlfpvvrrtinlptytleavyeavlgktksklgaeeiaaiweteesmk 
klaqysmedaratyelgkef fpmeaelakligqsvwdvsrsstgnlvewyllrvayarnelapnkpdeeeykrrlrt 
tllggyvkepekglweniiyldfrslypsiivthnvspdtlekegcknydvapivgyrfckdfpgfipsilgdliam 
rqdikkkmkstidpiekkmldyrqraikllansyygymgypkarwyskecaesvtawgrhyiemtireieekfgf kv 
lyadtdgfyatipgekpelikkkakef Inyinsklpglleleyegf ylrgf fvtkkryavideegrittrglewrr 
dwseiaketqakvleailkegsvekavewrdwekiakyrvpleklviheqitrdlkdykaigphvaiakrlaarg 
ikvkpgtiisyivlkgsgkisdrvillteydprkhkydpdyyienc[vlpavlrileafgyrkedlryqsskqtglda 
wlkr [SEQ ID NO. 51] 

>Vent Y387H 

mildtdyitkdgkpiirif kkengefkieldphfqpyiyallkddsaieeikaikgerhgktvrvldavkvrkkflg 
revevwklifehpqdvpainrgkirehpavvdiyeydipfakrylidkglipinegdeelkllafdietfyhegdefgk 
ge i imis yadeee arvitwknidlpyvdv^sner emi kr f vqvvke kdpdviityngdbaf dlpy likra 
Igrdkehpepkiqrmgdsfaveikgrihfdlfpwrrtinlptytleavyeavlgktksklgaeeiaaiweteesmk 
klaqysmedaratyelgkef fpmeaelakligqsvwdvsrsstgnlvewyllrvayamelapnkpdeeeykrrlrt 
thlggyvkepekglweniiyldfrslypsiivthnvspdtlekegcknydvapivgyrfckdfpgfipsilgdliam 
rqdikkkmkstidpiekkmldyrqraikllansyygymgypkarwyskecaesvtawgrhyiemtireieekf gf kv 
lyadtdgfyatipgekpelikkkakeflnyinsklpglleleyegf ylrgf fvtkkryavideegrittrglewrr 
dwseiaketqakvleailkegsvekavewrdwekiakyrvpleklviheqitrdlkdykaigphvaiakrlaarg 
ikvkpgtiisyivlkgsgkisdrvillteydprkhkydpdyyienqvlpavlrileafgyrkedlryqsskqtglda 
wlkr [SEQ ID NO, 52] 

>Vent Y387Q 

mildtdyitkdgkpiirif kkengefkieldphfqpyiyallkddsaieeikaikgerhgktvrvldavkvrkkflg 
revevwklifehpqdvpamrgkirehpawdiyeydipfakrylidkglipmegdeelkllafdietfyhegdefgk 
geiimisyadeeearvitwknidlpyvdvvsneremikrfvqfvsrkekdpdviityngdnf dlpylikraeklgvrlv 
Igrdkehpepkiqrmgdsfaveikgrihfdlfpwrrtinlptytleavyeavlgktksklgaeeiaaiweteesmk 
klaqysmedaratyelgkef fpmeaelakligqsvwdvsrsstgnlvewyllrvayamelapnkpdeeeykrrlrt 
tqlggyvkepekglweniiyldfrslypsiivthnvspdtlekegcknydvapivgyrfckdfpgfipsilgdliam 
rqdikkkmkstidpiekkmldyrqraikllansyygymgypkarwys kecaesvtawgrhyiemtireieekf gf kv 
lyadtdgfyatipgekpelikkkakef Inyinsklpglleleyegf ylrgf fvtkkryavideegrittrglewrr 
dwseiaketqakvleailkegsvekavewrdwekiakyrvpleklviheqitrdlkdykaigphvaiakrlaarg 
ikvkpgtiisyivlkgsgkisdrvillteydprkhkydpdyyienqvlpavlrileafgyrkedlryqsskqtglda 
wlkr [SEQ ID NO. 53] 

>Vent Y397S 

mildtdyitkdgkpiirif kkengefkieldphfqpyiyallkddsaieeikaikgerhgktvrvldavJcvrkkflg 
revevwklifehpqdvpamrgkirehpawdiyeydipfakrylidkglipmegdeelkllafdietfyhegdefgk 
geiimisyadeeearvitwknidlpyvdwsneremilcrfvqwkekdpdviityngdnf dlpylikraeklgvrlv 
Igrdkehpepkiqrmgdsfaveikgrihfdlfpwrrtinlptytleavyeavlgktksklgaeeiaaiweteesmk 
klaqysmedaratyelgkef fpmeaelakligqsvwdvsrsstgnlvewyllrvayamelapnkpdeeeykrrlrt 
tslggyvkepekglweniiyldfrspLypsiivthnvspdtlekegcknydvapivgyrfckdfpgfipsilgdliam 
rqdikklcmkstidpiekkmldyrqraikllansyygymgypkarwys kecaesvtawgrhyiemtireieekf gfkv 
lyadtdgfyatipgekpelikkkakef Inyinsklpglleleyegf ylrgf fvtkkryavideegrittrglewrr 
dwseiaketqakyleailkegsvekavewrdwekiakyrvpleklviheqitrdlkdykai'gphvaiakrlaarg 
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ikvkpgtiisyivlkgsgkisdrvillteydprkhkydpdyyienqvlpavlrileafgyrkedlryqssk<rtal 
wlkr [SEQ ID NO. 54] 

>Vent G38 9S 

mildtdyitkdgkpiirifkkengefkieldphfqpyiyallkddsaieeikaikgerhgktvrvldavkvrkkflg 

revevwklifehpqdvpamrgkirehpawdiyeydipf akrylidkglipmegdeelkllafdietfyhegdefgk 

geiimisyadeeearvitwknidlpyvdvvsneremikrfvqvvkekdpdviityngdnfdlpylikraeklgvrlv 

Igrdkehpepkiqrmgdsfaveikgrihfdlfpvvrrtinlptytleavyeavlgktksklgaeeiaaiweteesmk 

klaqysmedaratyelgkef fpmeaelakligqsvwdvsrsstgnlvewyllrvayamelapnkpdeeeykrrlrt 

tylsgyvkepekglweniiyldfrslypsiivthnvspdtlekegcknydvapivgyrfckdfpgfipsilgdliam 

rqdikkkmkstldpiekkmldyrqraikllansyygymgypkarwyskecaesvtawgrhyiemtireieekfgfkv 

lyadtdgfyatipgekpelikkkakeflnyinsklpglleleyegfylrgffvtkkryavideegrittrglevvrr 

dwseiaketqakvleailkegsvekavewrdvvekiakyrvpleklviheqitrdlkdykaigphvaiakrlaarg 

ikvkpgtiisyivlkgsgkisdrvillteydprkhkydpdyyienqvlpavlrileafgyrkedlrvQsskcrtalda 
wlkr [SEQ ID NO. 55] 

>Vent G38 9P 

mildtdyitkdglqpiirifkkengefkieldphfqpyiyallkddsaieeikaikgerhgktvrvldavkvrkkflg 
revevwklifehpqdvpainrgkirehpawdiyeydipfakrylidkglipmegdeelkllafdietfyhegdefgk 
geiimisyadeeearvitwknidlpyvdwsneremikrfvqwkekdpdvlityngdnfdlpy 

Igrdkehpepkiqrmgdsfaveikgrihfdlfpvvrrtinlptytleavyeavlgktksklgaeeiaaiweteesmk 

klaqysmedaratyelgkef fpmeaelakligqsvwdvsrsstgnlvewyllrvayarnelapnkpdeeeykrrlrt 

tylpgyvkepekglweniiyldfrslypsiivthnvspdtlekegcknydvapivgyrfckdfpgfipsilgdliam 

rqdikkkmkstidpieklaaldyrqraikllansyygymgypkarwyskecaesvtawgrhyiemtlreieekfgfkv 

lyadtdgfyatipgekpelikkkakeflnyinsklpglleleyegfylrgffvtkkryavideegrittrglevvrr 

dwseiaketqakvleailkegsvekavewrdwekiakyrvpleklviheqitrdlkdykaigphvaiakrlaarg 

ikvkpgtiisyivlkgsgkisdrvillteydprkhkydpdyyienqvlpavlrileafgyrkedlryqsskotalda 
wlkr [SEQ ID NO. 56] 

>Vent G390A 

mildtdyitkdgkpiirifkkengefkieldphfqpyiyallkddsaieeikaikgerhgktvrvldavkvrkkflg 

revevwklifehpqdvpaiiurgkirehpavvdiyeydipfakrylidkglipmegdeelkllafdietf^^ 

geiijaisyadeeearvitwknidlpyvdwsneremikrfvqvvkekdpdviityngdnfdlpylikraek 

Igrdkehpepkiqrmgdsfaveikgrihfdlfpwrrtinlptytleavyeavlgktksklgaeeiaaiweteesmk 

klaqysmedaratyelgkef fpmeaelakligqsvwdvsrsstgnlvewyllrvayarnelapnkpdeeeykrrlrt 

tylgayvkepekglweniiyldfrslypsiivthnvspdtlekegcknydvapivgyrfckdfpgflpsilgdliam 

rqdikkkmkstidpiekknadyrqraikllansyygymgypkarwyskecaesvtawgrhyiemtire 

lyadtdgfyatipgekpelikkkakeflnyinsklpglleleyegfylrgffvtkkryavideegrittrglewrr 

dwseiaketqakvleailkegsvekavewrdwekiakyrvpleklviheqitrdlkdykaigphvaiakrlaarg 

ikv)q>gtiisyivlkgsgkisdrvillteydpr)dikydpdyyienqvlpavlrileafgyrkedlryqsskqtglda 
Vlkr [SEQ ID NO. 57] 

>Vent G390P 

mildtdyitkdgkpiirifkkengefkieldphfqpyiyallkddsaieeikaikgerhgktvrvldavkvrkkflg 

reyevwklifehpqdvpamrgkirehpawdiyeydipfakrylidkglipmegdeelkllafdietfyhegdefgk 

geiiiuisyadeeearvitwknidlpyvdwsnerenLikrfvqwkekdpdvlityngdnfdlpylik^ 

Igrdkehpepkiqnagdsfaveikgrihfdlfpwrrtinlptytleavyeavlgktksklgaeeiaaiweteesmk 

klaqysmedaratyelgkef fpmeaelalcligqsvwdvsrsstgnlvewyllrvayarnelapnkpdeeeykrrlrt - 

tylgpyvkepekglweniiyldfrslypsiivthnvspdtlekegcknydvapivgyrfckdfpgfipsilgdliam 

rqdikkkmkstidpiekkmldyrqraikllansyygymgypkarwyskecaesvtawgrhyiemtireieekfgfkv 

lyadtdgfyatipgekpelikkkakeflnyinsklpglleleyegfylrgffvtkkryavideegrittrglewxr 

dwseiaketqakvleailkegsvekavewrdwekiakyrvpleldviheqitrdlkdykaigphvaialcriaarg 
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ikvkpgtiisyivlkgsgkisdrvillteydprldikydpdyyienqvlpavlrileafgyrkedlryqsskqtglda 
wlkr [SEQ ID NO. 58] 

>Deep Vent wild type 

mildadyitedgkpiirif kkengef kveydrnfrpyiyallkddsqidevrkitaerhgkivriidaekvrkkflg 
rpievwrlyf ehpqdvpairdkirehsavidif eydipfakrylidkglipmegdeelkllafdietlyhegeef ak 
gpiimisyadeeeakvitwkkidlpyvevvssereraikrf Ikvirekdpdviityngdsfdlpylvkraeklgiklp 
Igrdgsepkmqrlgdmtaveikgrihfdlyhvirrtinlptytleavyeaifgkpkekvyaheiaeawetgkglerv 
akysmedakvtyelgref fpmeaqlsrlvgqplwdvsrsstgnlvewyllrkayernelapnkpdereyerrlresy 
«^ggyvkepekglweglvsldfrslypsiiithnvspdtlnregcreydvapevghkfckdfpgf ipsllkrllderq 
eikrkmkaskdpiekkmldyrqraikilansyygyygyakarwyckecaesvtawgreyiefvrkeleekf gf kvly 
idtdglyatipgakpeeikkkalefvdyinaklpglleleyegf yvrgf f vtkkkyalideegkiitrgleivrrdw 
seiaketqakvleailkhgnveeavkivkevteklskyeippeklviyeqitrplheykaigphvavakrlaargvk 
vrpgmvigyivlrgdgpiskrailaeefdlrkhkydaeyyienqvlpavlrileafgyrkedlrwqktkqtgltawl 
nikkk [SEQ ID NO. 59] 

>Deep Vent Y385N 

mildadyitedgkpiirif kkengef kveydrnfrpyiyallkddsqidevrkitaerhgkivriidaekvrkkflg 
rpievwrlyfehpqdvpairdkirehsavidif eydipfakrylidkglipmegdeelkllafdietlyhegeefak 
gpiimisyadeeeakvitwkkidlpyvevvsseremikrflkvirekdpdviityngdsfdlpylvkraeklgiklp 
Igrdgsepkmqrlgdmtaveikgrihfdlyhvirrtinlptytleavyeaifgkpkekvyaheiaeawetgkglerv 
akysmedakvtyelgref fpmeaqlsrlvgqplwdvsrsstgnlvewyllrkayernelapnkpdereyerrlresn 
aggyvkepekglweglvsldfrslypsiiithnvspdtlnregcreydvapevghkfckdfpgfipsllkrllderq 
eikrkmkaskdpiekkmldyrqraikilansyygyygyakarwyckecaesvtawgreyiefvrkeleekfgf kvly 
idtdglyatipgakpeeikkkalef vdyinaklpglleleyegf yvrgf f vtkkkyalideegkiitrgleivrrdw 
seiaketqakvleailJdignveeavkivkevteklskyeippeklviyeqitrplheykaigphvavakrlaargvk 
vrpgiavigyivlrgdgpiskrailaeefdlrkhkydaeyyienqvlpavlrileafgyrkedlrwqktkqtgltawl 
nikkk [SEQ ID NO. 60] 

>Deep Vent Y385L 

mildadyitedgkpiirif kkengef kveydrnfrpyiyallkddsqidevrkitaerhgkivriidaekvrkkflg 
rpievwrlyfehpqdvpairdkirehsavidif eydipf akrylidkglipmegdeelkllafdietlyhegeefak 
gpiimisyadeeeakvitwkkidlpyvewsseremikrflkvirekdpdviityngdsfdlpylvkraeklgiklp 
Igrdgseplonqrlgdmtaveikgrihfdlyhvirrtinlptytleavyeaifgkpkekvyaheiaeawetgkglerv 
akysmedakvtyelgref fpmeaqlsrlvgqplwdvsrsstgnlvewyllrkayernelapnkpdereyerrlresl 
aggyvkepekglweglvsldfrslypsiiithnvspdtlnregcreydvapevghkfckdfpgfipsllkrllderq 
eikrkmkaskdpiekkmldyrqraikilansyygyygyakarwyckecaesvtawgreyiefvrkeleekfgf kvly 
idtdglyatipgakpeeikkkalef vdyinaklpglleleyegf yvrgf f vtkkkyalideegkiitrgleivrrdw 
seiaketqakvleailkhgnveeavkivkevteldskyeippeklviyeqitrplheykaigphvavakrlaargvk 
vrpgmvigyivlrgdgpiskrailaeefdlrkhkydaeyyienqvlpavlrileafgyrkedlrwqktkqtgltawl 
nikkk (SEQ ID NO. 61] 

>Deep Vent Y385H 

mildadyitedgkpiirif kkengef kveydrnfxpyiyallkddsqidevrkitaerhgkivriidaekvrkkflg 
rpievwrlyfehpqdvpairdkirehsavidif eydipf akrylidkglipmegdeelkllafdietlyhegeefak 
gpiimisyadeeeakvitwkkidlpyvewsseremikrf Ikvirekdpdviityngds fdlpylvkraeklgiklp 
Igrdgseplonqrlgdmtaveikgrihfdlyhvirrtinlptytleavyeaifgkpkekvyaheiaeawetgkglerv 
akysmedakvtyelgref fpmeaglsrlvgqplwdvsrsstgnlvewyllrkayernelapnkpdereyerrlresh 
a^gyvkepekglweglvsldfrslypsiiithnvspdtlnregcreydvapevghkfckdfpgfipslllcrllderq 
eikrkmkaskdpiekJanldyrqraikilansyygyygyakarwyckecaesvtawgreyiefvrkeleekfgf kvly 
idtdglyatipgakpeeikkkalef vdyinaklpglleleyegf yvrgf f vtkkkyalideegkiitrgleivrrdw 
seiaketqakvleailkhgnveeavkivkevteklskyeippeklviyeqitiTplheykaigphvavakrlaargvk 
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vrpgmvigyivlrgdgpiskrailaeefdlrkhkydaeyyienqvlpavlrileafgyrkedlrwqktkqtgltawl 
nikkk [SEQ ID NO. 62] 

>Deep Vent Y385Q 

mildadyitedgkpiirif kkengef kveydrnfrpyiyallkddsqidevrkitaerhgkivriidaekvrkkf Ig 
rpievwrlyfehpqdvpairdkirehsavidifeydipfakrylidkglipmegdeelkllafdietlyhegeefak 
gpiimisyadeeeakvitwkkidlpyvewsseremikrf Ikvirekdpdviityngdsfdlpylvlcraeklgiklp 
Igrdgsepkmqrlgdmtaveikgrihfdlyhvirrtinlptytleavyeaifgkpkekvyaheiaeawetgkglerv 
akysmedakvtyelgref fpmeaqlsrlvgqplwdvsrsstgnlvewyllrkayernelapnkpdereyerrlresq 
aggyvkepekglweglvsldfrslypsiiithnvspdtlnregcreydvapevghkfckdfpgfipsllkrllderq 
eikrkiakaskdpiekkmldyrqraikilansyygyygyakarwyckecaesvtawgreyiefvrkeleekfgf kvly 
idtdglyatipgakpeeikkkalefvdyinaklpglleleyegfyvrgffvtkkkyalideegkiitrgleivrrdw 
seiaketqakvleailkhgnveeavkivkevteklskyeippeklviyeqitrplheykaigphvavakrlaargvk 
vrpgmvigyivlrgdgpiskrailaeefdlrkhkydaeyyienqvlpavlrileafgyrkedlrwqktkqtgltawl 
nikkk [SEQ ID NO. 63] 

>Deep Vent Y38 5S 

mildadyitedgkpiirif kkengef kveydrnfrpyiyallkddsqidevrkitaerhgkivriidaekvrkkflg 
rpievwrlyfehpqdvpairdkirehsavidifeydipfakrylidkglipmegdeelkllafdietlyhegeefak 
gpiimisyadeeeakvitwkkidipyvewsseremikrf Ikvirekdpdviityngdsfdlpylvkraeklgiklp 
Igrdgsepkmqrlgdmtaveikgrihfdlyhvirrtinlptytleavyeaifgkpkekvyaheiaeawetgkglerv 
akysmedakvtyelgref fpmeaglsrlvgqplwdvsrsstgnlvewyllrkayernelapnkpdereyerrlress 
aggyvkepekglweglvsldfrslypsiiithnvspdtlnregcreydvapevghkfckdfpgfipsllkrllderq 
eikrkmkaskdpiekkmldyrqraikilansyygyygyakarwyckecaesvtawgreyiefvrkeleekfgf kvly 
idtdglyatipgakpeeikkkalefvdyinaklpglleleyegfyvrgffvtkkkyalideegkiitrgleivrrdw 
seiaketqakvleailkhgnveeavkivkevteklskyeippeklviyeqitrplheykaigphvavakrlaargvk 
vrpgmvigyivlrgdgpiskrailaeefdlrkhkydaeyyienqvlpavlrileafgyrkedlrwqktkqtgltawl 
nikkk [SEQ ID NO. 64] 

>Deep Vent G387S 

mildadyitedgkpiirif kkengef kveydrnfrpyiyallkddsqidevrkitaerhgkivriidaekvrkkflg 
rpievwrlyf ehpqdvpairdkirehsavidif eydipf akrylidkglipmegdeelkllafdietlyhegeefak 
gpiiinisyadeeeakvitwkkidlpyvewsseremikrflkvirekdpdviityngdsfdlpylvkraeklgiklp 
Igrdgsepkmqrlgdmtaveikgrihfdlyhvirrtinlptytleavyeaifgkpkekvyaheiaeawetgkglerv 
akysmedakvtyelgref fpmeaqlsrlvgqplwdvsrsstgnlvewyllrkayemelapn)<pdereyerrlresy 
asgyvkepekglweglvsldfrslypsiiithnvspdJilnregcreydvapevghkfckdfpgfipsllJcrllderq 
eikrkrakaskdpiekkmldyrqraikilansyygyygyakarwyckecaesvtawgreyiefvrkeleekfgf kvly 
idtdglyatipgakpeeikkkalefvdyinaklpglleleyegfyvrgf fvtkkkyalideegkiitrgleivrrdw 
seiaketqakvleailkhgnveeavkivkevteklskyeippeklviyeqitrplheykaigphvavakrlaargvk 
vrpgmvigyivlrgdgpiskrailaeefdlrkhkydaeyyienqvlpavlrileafgyrkedlrwqktkqtgltawl 
nikkk [SEQ ID NO. 65] 

>Deep Vent G387P 

mildadyitedgkpiirif kkengef kveydrnfrpyiyallkddsqidevrkitaerhgkivriidaekvrkkflg 
rpievwrlyf ehpqdvpairdkirehsavidif eydipf akrylidkglipmegdeelkllafdietlyhegeefak 
gpiimisyadeeea)cvitwkkidlpyvewsseremi3crflkvirekdpdviityngdsfdlpylvkraeklg±klp 
Igrdgseplanqrlgdmtaveikgrihfdlyhvirrtinlptytleavyeaifgkpkekvyaheiaeawetgkglerv 
akysmedakvtyelgref fpmeaqlsrlvgqplwdvsrsstgnlvewyllrkayemelapnkpdereyerrlresy 
apgyvkepekglweglvsldfrslypsiiithnvspdtlnregcreydvapevghkf ckdfpgfipsllkrllderq 
eikr)aakaskdpiek3aaldyrqraikilansyygyygya]carwyckecaesvtawgreyiefvrkeleekfgf kvly 
idtdglyatipgakpeeikkkalefvdyinaldpglleleyegfyvrgf fvtkkkyalideegkiitrgleivrrdw 
seiaketqakvleailkhgnveeavkivkevteklskyeippeklviyeqitrplheykaigphvavakrlaargvk 
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vrpgmvigyivlrgdgpiskrailaeefdlrkhkydaeyyienqvlpavlrileafgyrkedlrwaktkatalt 
nikkk (SEQ ID NO. 66] m h y 

>Deep Vent G388A 

inildadyitedgkpiirifkkengefkveydrnfrpyiyallkddsqidevrkitaerhgkivriidaekvrkkflg 

rpievwrlyfehpqdvpairdkirehsavidifeydipfakrylidkglipmegdeelkllafdietlyhegeefak 

gpiimisyadeeeakvitwkkidlpyvewsseremikrflkvirekdpdviityngdsfdlpylvkraeklgiklp 

Igrdgsepkmqrlgdmtaveikgrihfdlyhvirrtinlptytleavyeaifgkpkekvyaheiaeawetgkglerv 

akysiaedakvtyelgreffpmeaqlsrlvgqplwdvsrsstgnlvewyllrkayernelapnkpdereyerrlresv 

agayvkepekglweglvsldfrslypsiiithnvspdtlnregcreydvapevghkfckdfpgfipsllkrllderq 

eikrkmkaskdpiekkmldyrqraikilansyygyygyakarwyckecaesvtawgreyiefvrkeleekfgfkvly 

idtdglyatipgakpeeikkkalefvdyinaklpglleleyegfyvrgffv±kkkyalideegkiitrgleivrrdw 

seiaketqakvleailkhgnveeavkivkevteklskyeippeklviyeqitrplheykaigphvavakrlaargvk 

vrpgmvigyivlrgdgpiskrailaeefdlrkhkydaeyyienqvlpavlrileafgyrkedlrwqktkotaltawl 
nikkk [SEQ ID NO. 67] ^ si«-y-i.u<iwi 

>Deep Vent G388P 

mildadyitedgkpiirifkkengefkveydrnfrpyiyallkddsqidevrkitaerhgkivriidaekvrkkflg 

rpievwrlyfehpqdvpairdkirehsavidifeydipfakrylidkglipmegdeelkllafdietlyhegeefak 

gpiimisyadeeeakvitwkkidlpyvevvsseremikrflkvirekdpdviityngdsfdlpylvkxaeklgiklp 

Igrdgsepkmqrlgdintaveikgrihfdlyhvirxtinlptytleavyeaifgkpkekvyaheiaeawetgkglerv 

akysmedakvtyelgreffpmeaqlsrlvgqplwdvsrsstgnlvewyllrkayernelapnkpdereyerrlresy 

agpyvkepekglweglvsldfrslypsiiithnvspdtlnregcreydvapevghkfckdfpgfipsllkrllderq 

eikrkmkaskdpiekknadyrqraikilansyygyygyakarwyckecaesvtawgreylefvrkeleekfgfkvly 

xdtdglyatipgakpeeikkkalefvdyinaklpglleleyegfyvrgffvtkkkyalideegkiitrgle±vrrdw 

seiaketqakvleailkhgnveeavkivkevteklskyeippeklviyeqitrplheykaigphvavakrlaargvk 

vrpgmvigyivlrgdgpiskrailaeefdlrkhkydaeyyienqvlpavlrileafgyrkedlrwqktkqtgltawl 
iidLkkk [SEQ ID NO* 68] 

Polymezrase Domain Mu'kant:s 

>Pfu D405E 

mildvdyiteegkpvirlfkkengkfkiehdrtfarpyiyallrddskieevkkitgerhgkivrivdvekvekkflg 

kpitvwklyiehpqdvptirekvrehpavvdifeydipfakrylidkglipmegeeelkilafdietlyhegeefgk 

gpiimisyadeneakvitwknidLLpyvevvsseremikrflriirekdpdiivtyngdsfdfpylakr^ 

igrdgsepkiaqrigdmtavevkgrihfdlyhvitrtinlptytleavyeaifgkpkekvyadeiakawesgenler^ 

akysmedakatyelgkeflpmeiqlsrlvgqplwdvsrsstgnlvewfllrkayernevapnkpseeeyqrrlresy 

tggfvkepekglwenivylefralypsiiithnvspdtlnlegcknydiapqvghkfckdipgfipsllghlleerq 

kiktkroketqc^iekilldyrqkaikllansfygyygyakarwyckecaesvtawgrkyielvwkeleekfgfkvly 

idtdglyatipggeseeikkkalefvkyinsklpglleleyegfykrgffvtkkryavideegkvitrgleivrrdw 

seiaketqarvletilkhgdveeavrivkeviqklanyeippeklaiyeqitrplheykaigphvavakklaakgvk 

ikpgmvigyivlrgdgpisnrailaeeydpkkhkydaeyyienqvlpavlrilegfgyrkedlrvaktrovaltswl 
nikks [SEQ ID NO. 69] ^ 

>P.fu T542P 

mildvdyiteegkpvirlfkkengkfkiehdrtfrpyiyallrddskieevkkitgerhgklvrivdvekvekkflg 

kpitvwklylehpqdvptirekvrehpav-v'difeydipfakrylidkglipmegeeelkila^^^ 

gpiiinisyadeneakvitwknidlpyvevvssereitiikrflriirekdpdiivtyngdsfdfpylak^ 

igrdgsepkmqrigdmtavevkgrihfdlyhvitrtinlptytleavyeaifgkpkekvyadeiakawesgenlerv 

akysmedakatyelgkeflpmeiqlsrlvgqplwdvsrsstgnlvewfllrkayernevapnkpseeeyqrrlresy 

tggfvkepekglwenivyldfralypsiiithnvspdtlnlegcknydiapqvghkfckdipgfipsllghlleerq 

kiktkmketqdpiekilldyrqkaikllansfygyygyakarwyckecaesvtawgrkyielvwkeleekfgfkvly 

idpdglyatipggeseeikkkalefvkyinsklpglleleyegfykrgffvtkkryavideegkvitxgleivrrdw 
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seiaketqarvletilkhgdveeavrivkeviqklanyeippeklaiyeqitrplheykaigphvavakklaakavk 

ikpgiavigyj.vlrgdgpisnrailaeeycipkkhkydaeyyienqvlpavlrilegfgyrkedlryqktr«jvaltswl 
nikks [SEQ ID NO. 70] yM'^i-i-«jvgxT:sw± 



>Pfu D54 3G 



mildvdyiteegkpvirlfkkengkfkiehdrtfrpyiyallrddskieevkkitgerhgkivrivdvekvekkflo 
kpxtvwklylehpqdvptirekvrehpawdifeydipfakrylidkglipmegeeelkilafdietlyheqeefak 
gpiimisyadeneakvltwknidlpyvevvsseremikrflriirekdpdiivtyngdsfdfpylakraeklaiklt 
igrdgsepkmqrigdmtavevkgrihfdlyhvitrtinlptytleavyeaifgkpkekvyadeiakawesgenlerJ 



akysmedakatyelgkeflpitieiqlsrlvgqplwdvsrsstgnlvewfllrkayernevapnkpseeevqrrlresv 
i?u5wu^! S^V^ff'^y^^^^^^i^^iiithnvspdtlnlegcknydiapqvghkfckdipgfipsllghlleera 



idtgglyatipggeseeikkkalefvkyinsklpglleleyegfykrgffvtkkryavideegkvitraleivrrdw 
sexaketqarvletilkhgdveeavrivkeviqklanyeippeklaiyeqitrplheykaigphvavakklaakavk 



>Pfu K593T 

mildvdyiteegkpvirlfkkengkfkiehdrtfrpyiyallrddskieevkkitgerhgklvxivdve^^^ 

kpitvwklylehpqdvptirekvrehpavvdifeydipfakrylidkglipmegeeelkilafdietlyhegeefak 
gpiimisyadeneakvitwknidlpyvewsseremikrflriirekdpdiivtyngdsfdfpylakrLklqi^^ 
xgrdgsepkmqrxgdmtayevkgrihfdlyhvitrtinlptytleavyeaifgkpkekvyadfLkawesgeServ 
akysmedakatyelgkeflpmeiqlsrlvgqplwdvsrsstgnlvewfllrkayemevapnkpseeeyq?rlresy 

kiktkmketqdpiekilldyrqkaikllansfygyygyakarwyckecaesvtawgrkyielvwkeleekfgfkvlv 
idtdglyatxpggeseeikkkalefvkyinsklpglleleyegfykrgffvtktryavideegkvitrgleivSdi 
sexaketqarvletxlkhgdveeavrivkeviqklanyeippeklaiyeqitrplheykaig^hvavaJklaakSvk 

>Tgo D404E 

mildtdyitedgkpvirifkkengefkidydrnfepyiyallkddsaiedvkkitaerhgttvrwraekvkkkfla 

rpievwklyfthpqdvpairdkikehpawdiyeydlpfakrylidkglipmegdeelkmlafdietlyhegeefae 

gpilinisyadeegaarvitwknidlpyvdvvstekeinikrflkvvkekdpdvlityngdnfdfaylkkrLklavkfl 

IgregsepkxqrmgdrfavevkgrihfdlypvirrtinlptytleavyeaifgqpkekvyaeeLqawetgeSS 

arysmedakvtyelgkeffpmeaqlsrlvgqslwdvsrsstgnlvewfllrkayernelapnkpderelarrresva 

vkkkmkatxdpxekklldyrqraikilansfygyygyakarwyckecaesvtawgrqyiettireieekfgfkvlva 
dtdgffatipgadaetvkkkakefldyinaklpglleleyegfykrgffvtkkkyavideedkittrgleivrrdws 
eiaketqarvleailkhgdveeavrivkevteklskyevppeklviyeqitrdlkdykatgphvavakrlaargiki 

>Tgo T541P 

mlldtdyitedgkpvirifkkengefkidydrnfepyiyallkddsaiedvkkitaerhgttvrwraekvkkkflq 
rpievwklyfthpqdvpairdkikehpawdiyeydipfakrylidkglipmegdeelkmlafdietlyhegeefae 
pilmisyadeegarvitwknidlpyvdvvstekemikrflkvvkekdpdvlityngdnfdfaylkkrseklavkfi 
Igregsepkiqrmgdrfavevkgrihfdlypvirrtinlptytleavyeaifgqpkekvyaeeiaqawetgeglerv 
arysmedakvtyelgkeffpmeaqlsrlvgqslwdvsrsstgnlvewfllrkayernelapnkpderelarrresya 

vkkkmkatidpiekklldyrqraikilansfygyygyakatwyckecaesvtawgrqyiettireieekfgfkvlva 
dpdgffatipgadaetvkkkakefldyinaklpglleleyegfykrgffvtkkkyavideedkittrgleivrrdws 
eiaketqarvleailkhgdveeavrivkevteklskyevppeklviyeqltrdlkdykatgphvavakrlaargiki 
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rpgtvisyivlkgsgrigdraipfdefdpakhkydaeyyienqvlpaverilrafgyrkedlryqktrqvglgawlk 
pkt tSEQ ID NO. 74] 

>Tgo D542G 

mildtdyitedgkpvirif kkengefkidydrnf epyiyallkddsaiedvkkitaerhgttvrvvraekvkkkflg 

rpievwklyfthpqdvpairdkikehpawdiyeydipfakrylidkglipmegdeelkmlafdietlyhegeefae 

gpilmisyadeegarvitwknidlpyvdvvstekemikrflkwkekdpdvlityngdnfdfaylkkrseklgvkfi 

Igregsepkiqrmgdrfavevkgrihfdlypvirrtinlptytleavyeaifgqpkekvyaeeiaqawetgeglerv 

arysmedakvtyelgkeffpmeaqlsrlvgqslwdvsrss.tgnlvewfllrkayernelapnkpderelarrresya 

ggyvkeperglwenivyldfrslypsiiithnvspdtlnregceeydvapqvghkfckdfpgfipsllgdlleerqk 

vkkkmkatidpiekklldyrqraikilansfygyygyakarwyckecaesvtawgrqyiettireieekfgf kvlya 

dtggf fatipgadaetvkkkakefldyinaklpglleleyegfykrgffvtkkkyavideedkittrgleivrrdws 

eiaketqarvleailkhgdveeavrivkevteklskyevppeklviyeqitrdlkdykatgphvavakrlaargiki 

rpgtvisyivlkgsgrigdraipfdefdpakhkydaeyyienqvlpaverilrafgyrkedlryqktrqvglgawlk 
pkt [SEQ ID NO. 75] 

>Tgo K592T 

mildtdyitedgkpvirif kkengef kidydrnf epyiyallkddsaiedvkkitaerhgttvrvvraekvkkkflg 

rpieywklyfthpqdvpairdkikehpavvdiyeydipfakrylidkglipmegdeel)anlafdietlyhegeefae 

gpilmisyadeegaxvitwknidlpyvdvvstekemikrflkvvkekdpdvlityngdnfdfaylkkrseklgvkfi 

Igregsepkiqrmgdrfavevkgrihfdlypvirrtinlptytleavyeaifgqpkekvyaeeiaqawetgeglerv 

arysmedakvtyelgkeffpmeaqlsrlvgqslwdvsrsstgnlvewfllrkayernelapnkpderelarrresya 

ggyvkeperglwenivyldfrslypsiiithnvspdtlnregceeydvapqvghkfckdfpgfipsllgdlleerqk 

vkkkmkatidpiekklldyrqraikilansfygyygyakarwyckecaesvtawgrqyiettlreieekfgf kvlya 

dtdgffatipgadaetvkkkakefldyinaklpglleleyegfykrgffvtktkyavideedkittrgleivrrdws 

eiaketqarvleailkhgdveeavrivkevteklskyeyppeklviyeqitrdlkdykatgphvavakrlaargiki 

rpgtvisyivlkgsgrigdraipfdefdpalchkydaeyyienqvlpaverilrafgyrkedlryqktrqvglgawlk 
pkt [SEQ ID NO* 76] 

>KOD D4 04E 

mildtdyitedgkpvirif kkengef kieydrtfepyfyallkddsaieevkkitaerhgtwtvkrvekvqkkflg 
rpvevwklyfthpqdvpairdkirehgavidiyeydipf akrylidkglvpmegdeelkmlafdlqtlyhegeefae 
gpilmisyadeegarvitwknvdlpyvdvvsteremikrfirvvkekdpdvlityngdnfdfaylkkrceklginfa 
IgrdgsepJciqrmgdrfavevkgrihfdlypvirrtinlptytleavyeavfgqpkekvyaeeitpawetgenlerv 
arysmedakvtyelgkeflpmeaqlsrligqslwdvsrsstgnlvewfllrkayernelapnkpdekelarrrqsye 
ggyvkeperglwenivylefrslypsiiithnvspdtlnregckeydvapqvghrfckdfpgf ipsllgdlleerqk 
ikkkmkatidpierklldyrqraikilansyygyygyararwyckecaesvtawgreyitmtikeieekygf kviys 
dtdgffatipgadaetvkkkameflnyinaklpgaleleyegfykrgffvtkkkyavideegkittrgleivrrdws 
eiaketqarvleallkdgdvekavrivkevteklskyevppeklviheqitrdlkdykatgphvavakrlaargvki 

rpgtvisyivlkgsgrigdraipfdefdptkhkydaeyyienqvlpaverilrafgyrkedlryqktrqvglsawlk 
pkgt [SEQ ID NO. 77] 

>KOD T541P 

mldtdyitedgkpvirif kkengef kieydrtfepyfyallkddsaieevkkitaerhgtvvtvkrvekvqkkflg 
rpvevwklyfthpqdvpairdkirehgavidiyeydipfalcrylidkglvpmegdeellanlafdiqtlyhegeefae 
gpilnd.syadeegarvitwknvdlpyvdvvstereiaikrflrvvkekdpdvlityngdnfdfaylkkrceklginfa 
Igrdgsepkiqrmgdrfavevkgrihfdlypvirrtinlptyt^leavyeavfgqpkekvyaeeitpawetgenlerv 
arysmedalcvtyelgkeflpmeaqlsrligqslwdvsrsstgnlvewfllrkayernelapnkpdekelarrrqsye 
ggyvkeperglwerxivyldfrslypsiiithnvspdtlnregckeydvapqvghrfckdfpgfipsllgdlleerqk 
ikkkiiikatidpierklldyrqraikilansyygyygyararwyckecaesvtawgreyitmtikeieekygf kviys 
dpdgffatipgadaetvk]dcameflnyinaklpgaleleyegfy)crgffvtkkkyavideeg]cittrgleivrrdws 
eiaketqarvleallkdgdvekavrivkevteklskyevppeklviheqitrdlkdykatgphvavakrlaargvki 
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rpgtvisyivlkgsgrigdraipfdefdptkhkydaeyyienqvlpaverilrafgyrkedlryqktrqvglsawlk 
pkgt [SEQ ID NO. 78] 

>KOD D54 2G 

mildtdyitedgkpvirif kkengef kieydrtf epyf yallkddsaieevkkitaerhgtvvtvkrvekvqkkf Ig 
rpvevwklyf thpqdvpairdkirehgavidiyeydipf akrylidkglvpmegdeelkmlaf diqtlyhegeef ae 
gpilmisyadeegarvitwknvdlpyvdwsteremikrf Irvvke kdpdvlityngdnf df aylkkrceklginf a 
Igrdgsepkiqrmgdrfavevkgrihfdlypvirrtinlptytleavyeavf gqpkekvyaeeitpawetgenlerv 
arysmedakvtyelgkeflpiaeaqlsrligqslwdvsrsstgnlvewf llrkayernelapnkpdekelarrrqsye 
ggyvkeperglwenivyldfrslypsiiithnvspdtlnregckeydvapqvghrfckdfpgf ipsllgdlleerqk 
ikkkmkatldpierklldyrqraikxlansyygyygyararwyckecaesvtawgreyitmtikeieekygf kviys 
dtggff atipgadaetvkkkamef Inyinaklpgaleleyegfykrgf fvtkkkyavideegkittrgleivrrdws 
eiaketqarvleallkdgdvekavrivkevteklskyevppeklviheqitrdlkdykatgphvavakrlaargvkl 
rpgtvisyivlkgsgrigdraipfdefdptkhkydaeyyienqvlpaverilrafgyrkedlryqktrqvglsawlk 
pkgt [SEQ ID NO. 79] 

>KOD K592T 

ndldtdyitedgkpvirif kkengef kieydrtf epyf yallkddsaieevkkitaerhgtvvtvkrvekvqkkfig 
rpvevwkl yf thpqdvpairdkirehgavidiyeydipf akrylidkglvpmegdeelkmlaf diqtlyhegeef ae 
gpilmisyadeegarvitwknvdlpyvdwsteremikrflrvvkekdpdvlityngdnfdf aylkkrceklginf a 
Igrdgseplciqrmgdrfavevkgrihfdlypvirrtinlptytleavyeavf gqpkekvyaeeitpawetgenlerv 
arysmedakvtyelgkeflpmeaqlsrligqslwdvsrsstgnlvewf llrkayernelapnkpdekelarrrqsye 
ggyvkeperglwenivyldfrslypsiiithnvspdtlnregckeydvapqvghrfckdfpgf ipsllgdlleerqk 
ikkkmkatidpierklldyrqraikilansyyg^'Ygya^a^wyckecaesvtawgreyitmtikeieekygf kviys 
dtdgf fatxpgadaetvkkkameflnyinaklpgaleleyegfykrgf fvtktkyavideegkittrgleivrrdws 
eiaketqaarvleallkdgdvekavrivkevteklskyevppeklviheqitrdlkdykatgphvavakrlaargvki 
rpgtvisyivlkgsgrigdraipfdefdptkhkydaeyyienqvlpaverilrafgyrkedlryqktrqvglsawlk 
pkgt [SEQ ID NO. 80] 

>Vent D407E 

mildtdyitkdgkpiirifkkengef kieldphf qpyiyallkddsaieeikaikgerhgktvrvldavkvrkkflg 
revevwldifehpqdvpamrgkirehpawdiyeydipfakrylidkglipmegdeelkllafdietfyhegdefgk 
geiimisyadeeearvitwlmidlpyvdvvsneremikrfvqvvkekdpdviityngdnfdlpylikraeklgvrlv 
Igrdkehpepkiqrmgdsfaveikgrihfdlfpwrrtinlptytleavyeavlgktksklgaeeiaaiweteesmk 
klaqysmedaratyelgkef fpmeaelakligqsvwdvsrsstgnlvewyllrvayarnelapnkpdeeeykrrlrt 
tylggyvkepekglweniiylefrslypsiivtlinvspdtlekegc)cnydvapivgyrfckdfpgfipsilgdliam 
rqdikkkmkstidpiekkmldyrqraikllansyygymgypkarwyskecaesvtawgrhyiemtireieekfgf kv 
lyadtdgfyatipgekpelikkkakeflnyinsklpglleleyegfylrgffvtkkryavideegrittrglewrr 
dwseiaketqakvleailkegsvekayewrdwekiakyrvpleklviheqitrdlkdykaigphvaialcrlaarg 
ikvkpgtiisyivlkgsgkisdrviliteydprldikydpdyyienqvlpavlrileafgyrkedlryqsskcjtglda 
wlkr [SEQ ID NO. 81] 

>Vent T544P 

mildtdyitkdglcpiirif kkengef Jcieldphfqpyiyallkddsaieeikaikgerhgktvrvldavkvrkkflg 
revevwldifehpqdvpamrgkirehpawdiyeydipfakrylidkglipmegdeelkllafdietfyhegdef gk 
geiimisyadeeearvitwknidlpyvdvvsneremikrfvqvvkekdpdviityngdnfdlpylikraeklgvrlv 
Igrdkehpeplciqrmgdsfaveikgrihfdifpwrrtinlptytleavyeavlgktkslclgaeeiaaiweteesmk 
Iclaqysmedaratyelgkeffpmeaelakligqsvwdvsrsstgnlvewyllsrvayarnelapnkpdeeeykrrlrt 
tylggyvkepekglweniiyldfrslypsiivthnvspdtlekegclcnydvapivgyrfckdfpgfipsilgdliam 
rqdikkkmkstidpieklaaldyrqraikllansyygymgypkarwyskecaesvtawgrhyiemtireieekfgf kv 
lyadpdgf yatipgekpelikkkakeflnyinsklpglleleyegf ylrgf fvtkkryavideegrittrglewrr 
dwseiaketqakvleailkegsvekavewrdweJciakyrvpleklviheqitrdlkdykaigphvaiakrlaarg 
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ikvkpgtiisyivlkgsgkisdrvillteydprkhkydpdyyienqvlpavlrileafgyrkedlxyqsskqtglda 
wlkr [SEQ ID NO. 82] 

>Vent D545G 

mildtdyitkdgkpiirif kkengef kieldphfqpyiyallkddsaieeikaikgerhgktvrvldavkvrkkf Ig 
revevwklifehpqdvpamrgkirehpavvdiyeydipfakrylidkglipmegdeelkllafdietfyhegdefgk 
geiimisyadeeearvitwknidlpyvdwsneremikrfVqwkekdpdviityngdnfdlpyiikraeklgvrlv 
Igrdkehpepkiqrmgdsfaveikgrihfdlfpwrrtinlptytleavyeavlgktksklgaeeiaaiweteesmk 
klaqysmedaratyelgkeffpmeaelakligqsvwdvsrsstgnlvewyllrvayarnelapnkpdeeeykrrlrt 
tylggyvkepekglweniiyldfrslypsiivthnvspdtlekegcknydvapivgyrfckdfpgfipsilgdliam 
rqdikkkmkstidpiekkmldyrqraikllansyygymgypkarwyskecaesvtawgrhyiemtireieekfgf kv 
lyadtggfyatipgekpelikkkakeflnyinsklpglleleyegfylrgf fvtkkjryavideegrittrglewrr 
dwseiaketqakvleailkegsve kavewrdwekiakyrvpleklviheqitrdlkdykaigphvaiakrlaarg 

ikvkpgtiisyivlkgsgkisdrviliteydprkhkydpdyyienqvlpavlrileafgyrkedlryqsskqtglda 
wlkr [SEQ ID NO. 83] 

>Vent K595T 

mildtdyitkdgkpiirif kkengef kieldphfqpyiyallkddsaieeikaikgerhgktvrvldavkvrkkflg 

revevwklifehpqdvpamrgkirehpavvdiyeydipfakrylidkglipiaegdeelkllafdietfyhegdefgk 

geiimisyadeeearvitwknidlpyvdvvsneremikrfvqvvkekdt>dviityngdnfdlpylikraeklgvrlv 

Igrdkehpepkiqrmgdsfaveikgrihfdlfpwrrtinlptytlea'Vyeavlgktksklgaeeiaaiweteesmk 

klaqtysmedaratyelgkef fpmeaelakligqsvwdvsrsstgnlvewyllrvayarnelapnkpdeeeykrrlrt 

tylggyvkepekglweniiyldfrslypsiivthnvspdtlekegcknydvapivgyrfckdfpgfipsilgdliam 

rqdikkkmkstidpiekkmldyrqraikllansyygymgypkarwysk.ecaesvtawgrhyiemtireieekfgfkv 

lyadtdgfyatipgekpelikkkakeflnyinsklpglleleyegfylrgf fvtktryavideegrittrglevvrr 

dwseiaketqakvleailkegsvekavewrdvvekiakyrvpleklviheqitrdlkdykaigphvaiakrlaarg 

ikvkpgtiisyivlkgsgkisdrvillteydprkhkydpdyyienqvlpavlrileafgyrkedlryqsskqtglcia 
wlkr [SEQ ID NO. 84] 

>Deep Vent D405E 

mildadyitedgkpiirif kkengef kveydrnfrpyiyallkddsqidevrkitaerhgkivriidaekvrkkflg 
rpievwrlyfehpqdvpairdkirehsavidifeydipfakrylidkglipmegdeelkllafdietlyhegeefak 
gpiimisyadeeeakvitwkkidlpyvewsseremikrflkvirekdpdviityngdsfdlpylvkraeklgiklp 
igrdgsepkmqrlgdmtaveikgrihfdlyhvirrtinlptytleavyeaifgkpkekvyaheiaeawetgkglerv 
akysmedakvtyelgreffpmeaqlsrlvgqplwdvsrsstgnlvewyllrkayernelapnkpdereyerrlresy 
aggyvkepekglweglvslefrslypsiiithnvspdtlnregcreydvapevghkfckdfpgfipsllkrllderq 
ei3crkinkaskdpiekkinldyrqraikilansyygyygyakarwyckecaesvtawgreyiefvrkeleekfgfkvly 
idtdglyatipgakpeeikkkalefvdyinaklpglleleyegfyvrgffvtkkkyalideegkiitrgleivrrdw 
seiaketqalcvleailkhgnveeavldLvkevteklskyeippeklviyeqitrpliieykaigphvavalcrlaargvk 
vrpgmvigyivlrgdgpiskrailaeefdlrkhkydaeyyienqvlpavlrileafgyrkedlrwqktkqtgltawl 
nikkk [SEQ ID NO. 85] 

>Deep Vent T542P 

mildadyitedgkpiirif kkengef kveydrnfrpyiyailkddsqidevrkitaerhgkivriidaekvrkkflg 
rpievwrlyfehpqdvpairdkirehsavidifeydipfakrylidkglipmegdeelkllafdietlyhegeefak 
gpiimisyadeeeakvitwkkidlpyvevvsseremikrflkvirekdpdviityngdsfdlpylvkraelclgiklp 
IgrdgsepkraqrlgdmtaveikgrihfdlyhvirrtinlptytleavyeaifgJqpkekvyaheiaeawetgkgierv 
akysmedakvtyelgref fpmeaqlsrlvgqplwdvsrsstgnlvewyllrkayernelapnlcpdereyerrlresy 
ag^yvkepekglweglvsldfrslypsilithnvspdtlnregcreydvapevghkf ckdfpgfipsllkrllderq 
eikrkinkaskdpiekkialdyrqraikilansyygyygyakarwyckecaesvtawgreyiefvrkeleekfgf kvly 
idpdglyatipgakpeeikkkalefvdyinaklpglleleyegfyvrgffvtkkkyalideegkiitrgleivrrdw 
seiaketqakvleailkhgnveeavkivkevteklskyeippeldviyeqitrplheykaigphvavalcrlaargvk 
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vrpgmvigyivlrgdgpiskrailaeefcarkhkydaeyyienqvlpavlrileafgyrkedlrwqktkataltawl 
nikkk [SEQ ID NO. 86] 

>Deep Vent D543G 

mildadyitedgkpiirifkkengefkveydrnfrpyiyallkddsqidevrkitaerhgkivriidaekvrkkflg 

rpievwrlyfehpqdvpairdkirehsavidifeydipfakrylidkglipiaegdeelkllafdietlyhegeefak 

gpiimisyadeeeakvitwkkidlpyvevvsseremikrflkvirekdpdvlltyngdsfdlpylvkraeklgiklp 

Igrdgsepkmqrlgdmtaveikgrihfdlyhvirrtinlptytleavyeaifgkpkekvyaheiaeawetgkglerv 

akysroedakvtyelgreffpmeaqlsrlvgqplwdvsrsstgnlvewyllrkayernelapnkpdereyerrlresy 

aggyvkepekglweglvsldfrslypsiiithnvspdtlnregcreydvapevghkfckdfpgfipsllkrllderq 

eikrkmkaskdpiekkmldyrqraikilansyygyygyakarwyckecaesvtiawgreyiefvrkeleekfgfkvly 

idtgglyatipgakpeeikkkalefvdyinaklpglleleyegfyvrgffvtkkkyalideegkiitrgleivrrdw 

seiaketqakvleailkhgnveeavkivkevteklskyeippeklviyeqitrplheykaigphvavakrlaargvk 

vrpgmvigyivlrgdgpiskrailaeefdlrkhkydaeyyienqvlpavlrileafgyrkedlrwQktkotaltawl 
nikkk [SEQ ID NO. 87] ^ ^ y 

>Deep Vent K593T 

mildadyitedgkpiirifkkengefkveydrnfrpyiyallkddsqidevrkitaerhgkivriidaekvrkkflg 

rpievwrlyfehpqdvpairdkirehsavidifeydipfakrylidkglipmegdeelkllafdietlyhegeefak 

gpiimisyadeeeakvitwkkidlpyvewssereitiikrflkvirekdpdviityTigdsfdlpylv 

Igrdgsepkmqrlgdmtaveikgrihfdlyhvirrtinlptytleavyeaifgkpkekvyaheiaeawetgkglerv 

akysroedakvtyelgreffpmeaqlsrlvgqplwdvsrsstgnlvewyllrkayernelapnkpdereyerrlresy 

aggyvkepekglweglvsldfrslypsiiithnvspdtlnregcreydvapevghkfckdfpgfipsllkrllderq 

eikrkmkaskdpiekkmldyrqraikilansyygyygyakarwyckecaesvtawgreyiefvrkeleekfgfkvly 

idtdglyatipgakpeeikkkalefvdyinaklpglleleyegfyvrgffvtktkyalideegkiitrgleivrrdw 

seiaketqakvleailkhgnveeavkivkevteklskyeippeklviyeqitrplheykaigphvavakrlaargvk 

vrpgmvigyivlxgdgpiskrailaeefdlrklikydaeyyienqvlpavlrileafgyrkedlrwaktkataltaw 
nikkk [SEQ ID NO, 88] ^ m y 
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Pfu DNA polymerase nucleotd.de sequence 

atgattttag atgtggatta cataactgaa gaaggaaaac ctgttattag gctattcaaa 60 
aaagagaacg gaaaatttaa gatagagcat gatagaactt ttagaccata catttacgct 120 
cttctcaggg atgattcaaa gattgaagaa gttaagaaaa taacggggga aaggcatgga 180 
aagattgtga gaattgttga tgtagagaag gttgagaaaa agtttctcgg caagcctatt 240 
accgtgtgga aactttattt ggaacatccc caagatgttc ccactattag agaaaaagtt 300 
agagaacatc cagcagttgt ggacatcttc gaatacgata ttccatttgc aaagagatac 360 
ctcatcgaca aaggcctaat accaatggag ggggaagaag agctaaagat tcttgccttc 420 
gatatagaaa ccctctatca cgaaggagaa gagtttggaa aaggcccaat tataatgatt 480 
agttatgcag atgaaaatga agcaaaggtg attacttgga aaaacataga tcttccatac 540 
gttgaggttg tatcaagcga gagagagatg ataaagagat ttctcaggat tatcagggag 600 
aaggatcctg acattatagt tacttataat ggagactcat tcgcattccc atatttagcg 660 
aaaagggcag aaaaacttgg gattaaatta accattggaa gagatggaag cgagcccaag 720 
atgcagagaa taggcgatat gacggctgta gaagtcaagg gaagaataca tttcgacttg 7 80 
tatcatgtaa taacaaggac aataaatctc ccaacataca cactagaggc tgtatatgaa 840 
gcaatttttg gaaagceaaa ggagaaggta tacgccgacg agatagcaaa agcctgggaa 900 
agtggagaga accttgagag agttgccaaa tactcgatgg aagatgcaaa ggcaacttat 960 
gaactcggga aagaattcct tccaatggaa attcagcttt caagattagrt tggacaacct 1020 
ttatgggatg tttcaaggtc aagcacaggg aaccttgtag agtggttctt acttaggaaa 1080 
gcctacgaaa gaaacgaagt agctccaaac aagccaagtg aagaggagta tcaaagaagg 1140 
ctcagggaga gctacacagg tggattcgtt aaagagccag aaaaggggtt gtgggaaaac 1200 
atagtatacc tagattttag agccctatat ccctcgatta taattaccca caatgtttct 1260 
cccgatactc taaatcttga gggatgcaag aactatgata tcgctcctca agtaggccac 1320 
aagttctgca aggacatccc tggttttata ccaagtctct tgggacattt gttagaggaa 1380 
agacaaaaga ttaagacaaa aatgaaggaa actcaagatc ctatagaaaa aatactcctt 1440 
gactatagac aaaaagcgat aaaactctta gcaaattctt tctacggata ttatggctat 1500 
gcaaaagcaa gatggtactg taaggagtgt gctgagagcg ttactgcctg gggaagaaag 1560 
tacatcgagt tagtatggaa ggagctcgaa gaaaagtttg gatttaaagt cctctacatt 1620 
gacactgatg gtctctatgc aactatccca ggaggagaaa gtgaggaaat aaagaaaaag 1680 
gctctagaat ttgtaaaata cataaattca aagctccctg gactgctaga gcttgaatat 17 40 
gaagggtttt ataagagggg attcttcgtt acgaagaaga ggtatgcagt aatagatgaa 1800 
gaaggaaaag tcattactcg tggtttagag atagttagga gagattggag tgaaattgca 1860 
aaagaaactc aagctagagt tttggagaca atactaaaac acggagatgt tgaagaagct 1920 
gtgagaatag taaaagaagt aatacaaaag cttgccaatt atgaaattcc accagagaag 1980 
ctcgcaatat atgagcagat aacaagacca ttacatgag± ataaggcgat aggtcctcac 2040 
gtagctgttg caaagaaact agctgctaaa ggagttaaaa taaagccagg aatggtaatt 2100 
ggatacatag tacttagagg cgatggtcca attagcaata gggcaattct agctgaggaa 2160 
tacgatccca aaaagcacaa gtatgacgca gaatattaca tggagaacca ggttcttcca 2220 
gcggtactta ggatattgga gggatttgga tacagaaagg aagacctcag ataccaaaag 2280 
acaagacaag tcggcctaac ttcctggctt aacattaaaa aatcctag 2328 [SEQ ID NO. 89] 

Pfu Y385N NNN=AAT, AAG (All possible N codons) 

Pfu Y385L 1SINN=TTA, TTG, CTT, CTC, CTA, CTG (All possible L codons) 
Pfu Y38 5H NNN= CAT, CAC {All possible H codons) 
Pfu Y385Q NN1>I= CAA, CAG (All possible Q codons) 

Pfu Y385S NNN« TCT, TCC, TCA, TCG, AGT, AGC (All possible S codons) 

atgattttag atgtggatta cataactgaa .gaaggaaaac ctgttattag gctattcaaa 60 
aaagagaacg gaaaatttaa gatagagcat gatagaactt ttagaccata catttacgct 120 
cttctcaggg atgattcaaa gattgaagaa gttaagaaaa taacggggga aaggcatgga 180 
aagattgtga gaattgttga tgtagagaag gttgagaaaa agtttctcgg caagcctatt 240 
accgtgtgga aactttattt ggaacatccc caagatgttc ccactattag agaaaaagtt 300 
agagaacatc cagcagttgt ggacatcttc gaatacgata ttccatttgc aaagagatac 360 
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ctcatcgaca aaggcctaat accaatggag ggggaagaag agctaaagat tcttgccttc 420 
laltltaTa cgaaggagaa gagtttggaa aaggcccaat tataltgatt 480 

olllTaltta tlf^^^^'"^^ agcaaaggtg attacttgga aaaacataga tcttccJtac 540 
gttgaggttg tatcaagcga gagagagatg ataaagagat ttctcaggat tatcaaaaaa 600 
aaggatcctg acattatagt tacttataat ggagactcat tcgcat??cc aS?t?agcg 660 
aaaagggcag aaaaacttgg gattaaatta accattggaa gagatggaag cgagcccK? 720 
atgcagagaa taggcgatat gacggctgta gaagtcaagg gaJgaSacI tttlgZltl Vqo 
tatcatgtaa taacaaggac aataaatctc ccaacataca cactagaggc tgtaJaSal 840 
gcaatttttg gaaagccaaa ggagaaggta tacgccgacg agatagcaS agcctggSa 900 
agtggagaga accttgagag agttgccaaa tactcgatgg aagatjcaaa g?caart?a? 96? 
gaactcggga aagaattcct tccaatggaa attcagcttt calga?tagt ??gac2cct 1020 
ttatgggatg tttcaaggtc aagcacaggg aaccttgtag agtggttcS aSJJJgJJa lolo 
gcctacgaaa gaaacgaagt agctccaaac aagccaagtg aagaggagta tcaaaoSJa llln 
ctcagggaga gcNNNacagg tggattcgtt aaagagcca| aaLgJgS? gtgggXJfc lloo 
atagtatacc tagattttag agccctatat ccctcgatta taatSccca llttatttnt 
ITJtlt"'"' taaatcttga gggatgcaag aacta?gata tcgctcctc: TgtlggToTo 132^ 
aagttctgca aggacatccc tggttttata ccaagtctct tgggacattt g?ta?aaaaa l^fln 
agacaaaaga ttaagacaaa aatgaaggaa actcaagatc c?I?agaaaa aatactcctt i4!o 
gactatagac aaaaagcgat aaaactctta gcaaattctt tctacggaJa ttaJSgcS^ Isto 
gcaaaagcaa gatggtactg taaggagtgt gctgagagcg ttactgcctg ggga^Sa^a llso 
tacatcgagt tagtatggaa ggagctcgaa gaaaagtttg gatttlaag? SJctJca?? ^620 

gSSaSa? gSaiS?: ^T^^ff^ ggaggagaaa JtgaggaaS SgaSag Itlo 
gctctagaat ttgtaaaata cataaattca aagctccctg gactgctaga acttoaatat Man 
gaagggtttt ataagagggg attcttcgtt acgaagaaga ggtajgcajt aSSJgIa J2oS 
gaaggaaaag tcattactcg tggtttagag atagttagga gagatiggjg JgaaK^S; Ileo 
aaagaaactc aagctagagt tttggagaca atactaaaac acjgagaig? JJaaaaaaS lltn 
gtgagaatag taaaagaagt aatacaaaag cttgccaatt atgLSS; a^cagagSg islo 
ctcgcaatat atgagcagat aacaagacca ttacatgagt ataaggcgat agg?Sc?cac 2040 
gtagctgttg caaagaaact agctgctaaa ggagttaaaa taaagccagg aaJggJaS? lltn 
ggatacatag tacttagagg cgatggtcca attagcaata gggcaattc? aSc?S;aaa llso 
tacgatccca aaaagcacaa gtatgacgca gaatattdca JUagaacca gSJcScS llto 
gcggtactta ggatattgga gggatttgga tacagaaagg aJ^acctcag !?accaa!S 228^ 
acaagacaag tcggcctaac ttcctggctt aacattaaal aa?catag IsJs (sK ?D no 90] 

K« nZ' ^on' ''''''' possible S codons) 

Pfu G387P NNN= CCT, CCA, CCG, CCC (All possible P codons) 

atgattttag atgtggatta cataactgaa gaaggaaaac ctgttattag gctattcaaa 60 
aaagagaacg gaaaatttaa gatagagcat gatagaactt ttlgaccati catJtacac? iSo 
cttctcaggg atgattcaaa gattgaagaa gttaagaaaa taaJggggga aaggcaJJSa llo 
aagattgtga gaattgttga tgtagagaag gttgagaaaa agtt?c?Sg caaJSSlft ITo 
accgtgtgga aactttattt ggaacatccc caagatgtto ccactattig agalaJaS^ 3o2 
agagaacatc cagcagttgt ggacatcttc gaatacgata ttccatttg? Sag^gaSc 36^ 
ctcatcgaca aaggcctaat accaatggag ggggaagaag agctaaagit tctJgccttc ito 

cgaaggagaa gagtttggaa aaggcccLt tataKgaJ? 480 
^ItJr^^l^ atgaaaatga agcaaaggtg attacttgga aaaacataga tcttccSac 540 
gttgaggttg tatcaagcga gagagagatg ataaagagat ttctcaggat tatcagSao 600 
aaggatcctg acattatagt tacttataat . ggagactcat tcgcattccc ata?2f2^« tTn 
aaaagggcag aaaaacttgg gattaaatta IScattggaa gaga?ggSg cgagcccS? 72^ 
ta?^?.^^ taggcgatat gacggctgta gaagtcaagg gaagaaLc! tJSgScS? 78^ 
tatcatgtaa taacaaggac aataaatctc ccaacataca cactagaggc tgtatataaa 840 
gcaatttttg gaaagccaaa ggagaaggta tacgccgacg agatagcfL agcSJaSlal 900 
agtggagaga accttgagag agttgccaaa tactcgatgg algatgcaaa ggSSS? 96? 
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gaactcggga aagaattcct tccaatggaa attcagcttt caagattagt tggacaacct 1020 
ttatgggatg tttcaaggtc aagcacaggg aaccttgtag agtggttctt acttaggaaa 1080 
gcctacgaaa gaaacgaagt agctccaaac aagccaagtg aagaggagta tcaaagaagg 1140 
ctcagggaga gctacacaNN Nggattcgtt aaagagccag aaaaggggtt gtgggaaaac 1200 
atagtatacc tagattttag agccctatat ccctcgatta taattaccca caatgtttct 1260 
cccgatactc taaatcttga gggatgcaa,g aactatgata tcgctcctca agtaggccac 1320 
aagttctgca aggacatccc tggttttata ccaagtctct tgggacattt gttagaggaa 1380 
agacaaaaga ttaagacaaa aatgaaggaa actcaagatc ctatagaaaa aatactcctt 1440 
gactatagac aaaaagcgat aaaactctta gcaaattctt tctacggata ttatggctat 1500 
gcaaaagcaa gatggtactg taaggagtgt gctgagagcg ttactgcctg gggaagaaag 1560 
tacatcgagt tagtatggaa ggagctcgaa gaaaagtttg gatttaaagt cctctacatt 1620 
gacactgatg gtctctatgc aactatccca ggaggagaaa gtgaggaaat aaagaaaaag 1680 
gctctagaat ttgtaaaata cataaattca aagctccctg gactgctaga gcttgaatat 1740 
gaagggtttt ataagagggg attcttcgtt acgaagaaga ggtatgcagt aatagatgaa 1800 
gaaggaaaag tcattactcg tggtttagag atagttagga gagattggag tgaaattgca 18 60 
aaagaaactc aagctagagt tttggagaca atactaaaac acggagatgt tgaagaagct 1920 
gtgagaatag taaaagaagt aatacaaaag cttgccaatt atgaaattcc accagagaag 1980 
ctcgcaatat atgagcagat aacaagacca ttacatgagt ataaggcgat aggtcctcac 2040 
gtagctgttg caaagaaact agctgctaaa ggagttaaaa taaagccagg aatggtaatt 2100 
ggatacatag tacttagagg cgatggtcca attagcaata gggcaattct agctgaggaa 2160 
tacgatccca aaaagcacaa gtatgacgca gaatattaca tggagaacca ggttcttcca 2220 
gcggtactta ggatattgga gggatttgga tacagaaagg aagacctcag ataccaaaag 2280 
acaagacaag tcggcctaac ttcctggctt aacattaaaa aatcctag 2328 [SEQ ID NO. 91] 

Pfu G388A NNN= GCA, GCT, GCC^ GCG (All possible A codons) 
Pfu G388P NNN- CCT, CCA, CCG, CCC (All possible P codons) 

atgattttag atgtggatta cataactgaa gaaggaaaac ctgttattag gctattcaaa 60 
aaagagaacg gaaaatttaa gatagagcat gatagaactt ttagaccata catttacgct 120 
cttctcaggg atgattcaaa gattgaagaa gttaagaaaa taacggggga aaggcatgga 180 
aagattgtga gaattgttga tgtagagaag gttgagaaaa agtttctcgg caagcctatt 240 
accgtgtgga aactttattt ggaacatccc caagatgttc ccactattag agaaaaagtt 300 
agagaacatc cagcagttgt ggacatcttc gaatacgata ttccatttgc aaagagatac 360 
ctcatcgaca aaggcctaat accaatggag ggggaagaag agctaaagat tcttgccttc 420 
gatatagaaa ccctctatca cgaaggagaa gagtttggaa aaggcccaat tataatgatt 480 
agttatgcag atgaaaatga agcaaaggtg attacttgga aaaacataga tcttccatac 540 
gttgaggttg tatcaagcga gagagagatg ataaagagat ttctcaggat tatcagggag 600 
aaggatcctg acattatagt tacttataat ggagactcat tcgcattccc atatttagcg 660 
aaaagggcag aaaaacttgg gattaaatta accattggaa gagatggaag cgagcccaag 720 
atgcagagaa taggcgatat gacggctgta gaagtcaagg gaagaataca tttcgacttg 780 
tatcatgtaa taacaaggac aataaatctc ccaacataca cactagaggc tgtatatgaa 840 
gcaatttttg gaaagccaaa ggagaaggta tacgccgacg agatagcaaa agcctgggaa 900 
agtggagaga accttgagag agttgccaaa tactcgatgg aagatgcaaa ggcaacttat 960 
gaactcggga aagaattcct tccaatggaa attcagcttt caagattagt tggacaacct 1020 
ttatgggatg tttcaaggtc aagcacaggg aaccttgtag agtggttctt acttaggaaa 1080 
gcctacgaaa gaaacgaagt agctccaaac aagccaagtg aagaggagta tcaaagaagg 1140 
ctcagggaga gctacacagg tNNNttcgtt aaagagccag aaaaggggtt gtgggaaaac 1200 
atagtatacc tagattttag agccctatat ccctcgatta taattaccca caatgtttct 1260 
cccgatactc taaatcttga gggatgcaag aactatgata tcgctcctca agtaggccac 1320 
aagttctgca aggacatccc tggttttata ccaagtctct tgggacattt gttagaggaa 1380 
agacaaaaga ttaagacaaa aatgaaggaa actcaagatc ctatagaaaa aatactcctt 1440 
gactatagac aaaaagcgat aaaactctta gcaaattctt tctacggata ttatggctat 1500 



27/57 



wo 03/060144 



PCT/US02/40423 



gcaaaagcaa gatggtactg taaggagtgt gctgagaigcg ttactgcctg gggaagaaag 
tacatcgagt tagtatggaa ggagctcgaa gaaaagtttg gatttaaagt cctctacatt 
gacactgatg gtctctatgc aactatccca ggaggagaaa gtgaggaaat aaagaaaaag 1680 
gctctagaat ttgtaaaata cataaattca aagctccctg gactgctaga gcttgaatat 1740 
gaagggtttt ataagagggg attcttcgtt acgaagaaga ggtatgcagt aatagatgaa 
gaaggaaaag tcattactcg tggtttagag atagttagga gagattggag tgaaattgca 
aaagaaactc aagctagagt tttggagaca atactaaaac acggagatgt tgaagaagct 
gtgagaatag taaaagaagt aatacaaaag cttgccaatt atgaaattcc accagagaag 1980 
ctcgcaatat atgagcagat aacaagacca ttacatgagt ataaggcgat aggtcctcac 2040 
gtagctgttg caaagaaact agctgctaaa ggagttaaaa taaagccagg aatggtaatt 2100 
ggatacatag tacttagagg cgatggtcca attagca'ata gggcaattct agctgaggaa 2160 
tacgatccca aaaagcacaa gtatgacgca gaatattaca tggagaacca ggttcttcca 2220 
gcggtactta ggatattgga gggatttgga tacagaaagg aagacctcag ataccaaaag 2280 
acaagacaag tcggcctaac ttcctggctt aacattaaaa aatcctag 2328 [SKQ ID NO. 



1560 
1620 



1800 
1860 
1920 



92] 



Pfu D405E NNN= GAA, GAG (All possible E codons) 

atgattttag atgtggatta. cataactgaa gaaggaaaac ctgttattag gctattcaaa 60 
aaagagaacg gaaaatttaa gatagagcat gatagaactt ttagaccata catttacgct 120 
cttctcaggg atgattcaaa gattgaagaa gttaagaaaa taacggggga aaggcatgga 180 
aagattgtga gaattgttga iigtagagaag gttgagaaaa agtttctcgg caagcctatt 240 
accgtgtgga aactttattt ggaacatccc caagatgttc ccactattag agaaaaagtt 300 
agagaacatc cagcagttgt ggacatcttc gaatacgata ttccatttgc aaagagatac 360 
ctcatcgaca aaggcctaat accaatggag ggggaagaag agctaaagat tcttgccttc 420 
gatatagaaa ccctctatca cgaaggagaa gagtttggaa aaggcccaat tataatgatt 480 
agttatgcag atgaaaatga agcaaaggtg attacttgga aaaacataga tcttccatac 540 
gttgaggttg tatcaagcga gagagagatg ataaagagat ttctcaggat tatcagggag 600 
aaggatcctg acattatagt tacttataat ggagactcat tcgcattccc atatttagcg 660 
aaaagggcag aaaaacttgg gattaaatta accattggaa gagatggaag cgagcccaag 720 
atgcagagaa taggcgatat gacggctgta gaagtcaagg gaagaataca tttcgacttg 780 
tatcatgtaa taacaaggac aataaatctc ccaacataca cactagaggc tgtatatgaa 840 
gcaatttttg gaaagccaaa ggagaaggta tacgccgacg agatagcaaa agcctgggaa 900 
agtggagaga accttgagag agttgccaaa tactcgatgg aagatgcaaa ggcaacttat 960 
gaactcggga aagaattcct tccaatggaa attcagcttt caagattagt tggacaacct 1020 
ttatgggatg tttcaaggtc aagcacaggg aaccttgtag agtggttctt acttaggaaa 1080 
gcctacgaaa gaaacgaagt agctccaaac aagccaagtg aagaggagta tcaaagaagg 1140 
ctcagggaga gctacacagg tggattcgtt aaagagccag aaaaggggtt gtgggaaaac 1200 
atagtatacc taNNNtttag agccctatat ccctcgatta taattaccca caatgtttct 1260 
cccgatactc taaatcttga gggatgcaag aactatgata tcgctcctca agtaggccac 1320 
aagttctgca aggacatccc tggttttata ccaagtctct tgggacattt gttagaggaa 1380 
agacaaaaga ttaagacaaa aatgaaggaa actcaagatc ctatagaaaa aatactcctt 14 4 0 
gactatagac aaaaagcgat aaaactctta gcaaattctt tctacggata ttatggctat 1500 
gcaaaagcaa gatggtactg taaggagtgt gctgagagcg ttactgcctg gggaagaaag 1560 
tacatcgagt tagtatggaa ggagctcgaa gaaaagtttg gatttaaagt cctctacatt 1620 
gacactgatg gtctctatgc aactatccca ggaggagaaa gtgaggaaat aaagaaaaag 1680 
gctctagaat ttgtaaaata cataaattca aagctccctg gactgctaga gcttgaatat 1740 
gaagggtttt ataagagggg attcttcgtt acgaagaaga ggtatgcagt aatagatgaa 1800 
gaaggaaaag tcattactcg tggtttagag atagttagga gagattggag tgaaattgca 1860 
aaagaaactc aagctagagt tttggagaca atactaaaac acggagatgt tgaagaagct 1920 
gtgagaatag taaaagaagt aatacaaaag cttgccaatt atgaaattcc accagagaag 1980 
ctcgcaatat atgagcagat aacaagacca ttacatgagt ataaggcgat aggtcctcac 2040 
gtagctgttg caaagaaact agctgctaaa ggagttaaaa taaagccagg aatggtaatt 2100 
ggatacatag tacttagagg cgatggtcca attagcaata gggcaattct agctgaggaa 2160 
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tacgatccca aaaagcacaa gtatgacgca gaatattaca tggagaacca ggttcttcca 2220 
gcggtactta .ggatattgga gggatttgga tacagaaagg aagacctcag ataccaaaag 2280 
acaagacaag tcggcctaac ttcctggctt aacattaaaa aatcctag 2328 [SEQ ID NO. 93] 

Pfu T542P NNN- CCT, CCA, CCG, CCC (All possible P codons) 
atgattttag atgtggatta cataactgaa gaaggaaaac ctgttattag gctattcaaa 60 
aaagagaacg gaaaatttaa gatagagcat gratagaactt ttagaccata catttacgct 120 
cttctcaggg atgattcaaa gattgaagaa gttaagaaaa taacggggga aaggcatgga 180 
aagattgtga gaattgttga tgtagagaag gttgagaaaa agtttctcgg caagcctatt 240 
accgtgtgga aactttattt ggaacatccc caagatgttc ccactattag agaaaaagtt 300 
agagaacatc cagcagttgt ggacatcttc gaatacgata ttccatttgc aaagagatac 360 
ctcatcgaca aaggcctaat accaatggag ggggaagaag agctaaagat tcttgccttc 420 
gatatagaaa ccctctatca cgaaggagaa gagtttggaa aaggcccaat tataatgatt 480 
agttatgcag atgaaaatga agcaaaggtg attacttgga aaaacataga tcttccatac 540 
gttgaggttg tatcaagcga gagagagatg ataaagagat ttctcaggat tatcagggag 600 
aaggatcctg acattatagt tacttataat ggagactcat tcgcattccc atatttagcg 660 
aaaagggcag aaaaacttgg gattaaatta accattggaa gagatggaag cgagcccaag 720 
atgcagagaa taggcgatat gacggctgta gaagtcaagg gaagaataca tttcgacttg 780 
tatcatgtaa taacaaggac aataaatctc ccaacataca cactagaggc tgtatatgaa 840 
gcaa -tttg gaaagccaaa ggagaaggta tacgccgacg agatagcaaa agcctgggaa 900 
agtgaagaga accttgagag agttgccaaa tactcgatgg aagatgcaaa ggcaacttat 960 
gaacicggga aagaattcct tccaatggaa attcagcttt caagattagt tggacaacct 1020 
ttat -ggatg tttcaaggtc aagcacaggg aaccttgtag agtggttctt acttaggaaa 1080 
gcct. ;aaaa gaaacgaagt agctccaaac aagccaagtg aagaggagta tcaaagaagg 1140 
ctci ?gaga gctacacagg tggattcgtt aaagagccag aaaaggggtt gtgggaaaac 1200 
atac atacc tagattttag agccctatat ccctcgatta taattaccca caatgtttct 1260 
cccg.iACtc taaatcttga gggatgcaag aactatgata tcgctcctca agtaggccac 1320 
aagt -^ca aggacatccc tggttttata ccaagtctct tgggacattt gttagaggaa 1380 
agac ^aaga ttaagacaaa aatgaaggaa actcaagatc ctatagaaaa aatactcctt 1440 
gact jagac aaaaagcgat aaaactctta gcaaattctt tctacggata ttatggctat 1500 
gcaa .^caa gatggtactg taaggagtgt gctgagagcg ttactgcctg gggaagaaag 1560 
tacc. lagt tagtatggaa ggagctcgaa gaaaagtttg gatttaaagt cctctacatt 1620 
gac!:. jatg gtctctatgc aactatccca ggaggagaaa gtgaggaaat aaagaaaaag 1680 
gctc-: .gaat ttgtaaaata cataaattca aagctccctg gactgctaga gcttgaatat 1740 
gaag^gtttt ataagagggg attcttcgtt acgaagaaga ggtatgcagt aatagatgaa 1800 
gaaggaaaag tcattactcg tggtttagag atagttagga gagattggag tgaaattgca 1860 
aaagaaactc aagctagagt tttggagaca atactaaaac acggagatgt tgaagaagct 1920 
gtgagaatag taaaagaagt aatacaaaag cttgccaatt atgaaattcc accagagaag 1980 
ctcgcaatat atgagcagat aacaagacca ttacatgagt ataaggcgat aggtcctcac 2040 
gtagctgttg caaagaaact agctgctaaa ggagttaaaa taaagccagg aatggtaatt 2100 
ggatacatag tacttagagg cgatggtcca attagcaata gggcaattct agctgaggaa 2160 
tacgatccca aaaagcacaa gtatgacgca gaatattaca tggagaacca ggttcttcca 2220 
gcggtactta ggatattgga gggatttgga tacagaaagg aagacctcag ataccaaaag 2280 
acaagacaag tcggcctaac ttcctggctt aacattaaaa aatcctag 2328 (SEQ ID NO. 94] 

Pfu D543G NNN-GGT, GGC, GGA, GGG (All possible G codons) 
atgattttag atgtggatta cataactgaa gaaggaaaac ctgttattag gctattcaaa 60 
aaagagaacg gaaaatttaa gatagagcat gatagaaett ttagaccata catttacgct 120 
cttctcaggg atgattcaaa gattgaagaa gttaagaaaa taacggggga aaggcatgga 180 
aagattgtga gaattgttga tgtagagaag gttgagaaaa agtttctcgg caagcctatt 240 
accgtgtgga aactttattt ggaacatccc caagatgttc ccactattag agaaaaagtt 300 
agagaacatc cagcagttgt ggacatcttc gaatacgata ttccatttgc aaagagatac 360 
ctcatcgaca aaggcctaat accaatggag ggggaagaag agctaaagat tcttgccttc 420 
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gatatagaaa ccctctatca cgaaggagaa gagtttggaa aaggcccaat tataatgatt 480 
agttatgcag atgaaaatga agcaaaggtg attacttgga aaaacataga . tcttccatac 54 0 
gttgaggttg tatcaagcga gagagagatg ataaagagat ttctcaggat tatcagggag 600 
aaggatcctg acattatagt tacttataat ggagactcat tcgcattccc atatttagcg 660 
aaaagggcag aaaaacttgg gattaaatta accattggaa gagatggaag cgagcccaag 720 
atgcagagaa taggcgatat gacggctgta gaagtcaagg gaagaataca tttcgacttg 780 
tatcatgtaa" taacaaggac aataaatctc ccaacataca cactagaggc tgtatatgaa 840 
gcaatttttg gaaagccaaa ggagaaggta tacgccgacg agatagcaaa agcctgggaa 900 
sigtggagaga accttgagag agttgccaaa tactcgatgg aagatgcaaa ggcaacttat 960 
gaactcggga aagaattcct tccaatggaa attcagcttt caagattagt tggacaacct 1020 
ttatgggatg tttcaaggtc aagcacaggg aaccttgtag agtggttctt acttaggaaa 1080 
gcctacgaaa gaaacgaagt agctccaaac aagccaagtg aagaggagtia tcaaagaagg 1140 
ctcagggaga gctacacagg tggattcgtt aaagagccag aaaaggggtt gtgggaaaac 1200 
atagtatacc tagattttag agccctatat ccctcgatta taattaccca caatgtttct 1260 
cccgatactc taaatcttga gggatgcaag aactatgata tcgctcctca agtaggccac 1320 
aagttctgca aggacatccc tggttttata ccaagtctct tgggacattt gttagaggaa 1380 
agacaaaaga ttaagacaaa aatgaaggaa actcaagatc ctatagaaaa aatactcctt 14 40 
gactatagac aaaaagcgat aaaactctta gcaaattctt tctacggata ttatggctat 1500 
gcaaaagcaa gatggtactg taaggagtgt gctgagagcg ttactgcctg gggaagaaag 1560 
tacatcgagt tagtatggaa ggagctcgaa gaaaagtttg gatttaaagt cctctacatt 1620 
gacactNNNg gtctctatgc aactatccca ggaggagaaa gtgaggaaat aaagaaaaag 1680 
gctctagaat ttgtaaaata cataaattca aagctccctg gactgctaga gcttgaatat 1740 * 
gaagggtttt ataagagggg attcttcgtt acgaagaaga ggtatgcagt aatagatgaa 1800 
gaaggaaaag tcattactcg tggtttagag atagttagga gagattggag tgaaattgca 1860 
aaagaaactc aagctagagt tttggagaca atactaaaac acggagatgt tgaagaagct 1920 
gtgagaatag taaaagaagt aatacaaaag cttgccaatt atgaaattcc accagagaag 1980 
ctcgcaatat atgagcagat aacaagacca ttacatgagt ataaggcgat aggtcctcac 2040 
gtagctgttg caaagaaacti agctgctaaa ggagttaaaa taaagccagg aatggtaatt 2100 
ggatacatag tacttagagg cgatggtcca attagcaata gggcaattct agctgaggaa 2160 
tacgatccca aaaagcaciaa gta-tgacgca gaatattaca tggagaacca ggttcttcca 2220 
gcggtactta ggatattgga gggatttgga tacagaaagg aagacctcag ataccaaaag 2280 
acaagacaag tcggcctaac ttcctggctt aacatliaaaa aatcctag 2328 [SEQ ID NO. 95] 

Pfu K593T NNN=ACT, ACC, ACA, ACG (All possible T codons) 
atgattttag atgtggatta cataactgaa gaaggaaaac ctgttattag gctattcaaa 60 
aaagagaacg gaaaatttaa gatagagcat gatagaactt ttagaccata catttacgct 120 
cttctcaggg atgattcaaa gattgaagaa gttaagaaaa taacggggga aaggcatgga 180 
aagattgtga gaattgttga tgtagagaag gttgagaaaa agtttctcgg caagcctatt 240 * 
accgtgtgga aactttattt ggaacatccc caagatgttc ccactattag agaaaaagtt 300 
agagaacatc cagcagttgt ggacatcttc gaatacgata ttccatttgc aaagagatac 360 
ctcatcgaca aaggcctaat accaatggag ggggaagaag agctaaagat tcttgccttc 420 
gatatagaaa ccctctatca cgaaggagaa gagtttggaa aaggcccaat tataatgatt 480 
agttatgcag atgaaaatga agcaaaggtg attacttgga aaaacataga tcttccatac 540 
gttgaggttg tatcaagcga gagagagatg ataaagagat ttctcaggat tatcagggag 600 
aaggatcctg acattatagt tacttataat ggagactcat tcgcattccc atatttagcg 660 
aaaagggcag aaaaacttgg gattaaatta accattggaa gagatggaag cgagcccaag . 720 
atgcagagaa taggcgatat gacggctgta gaagtcaagg gaagaataca tttcgacttg 780 
tatcatgtaa taacaaggac aataaatctc ccaacataca cactagaggc tgtatatgaa 840 
gca'atttttg gaaagccaaa ggagaaggta tacgccgacg agatagcaaa agcctgggaa 900 
agtggagaga accttgagag agttgccaaa tactcgatgg aagatgcaaa ggcaacttat 960 
gaactcggga aagaattcct tccaatggaa attcagcttt caagattagt tggacaacct 1020 
ttatgggatg tttcaaggtc aagcacaggg aaccttgtag agtggttctt acttaggaaa 1080 
gcctacgaaa gaaacgaagt agctccaaac aagccaagtg aagaggagta tcaaagaagg 1140 
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ctcagggaga 

atagtatacc 

cccgatactc 

aagttctgca 

aga'caaaaga 

gactatagac 

gcaaaagcaa 

tacatcgagt 

gacactgatg 

gctctagaat 

gaagggtttt 

gaaggaaaag 

aaagaaactc 

gtgagaatag 

ctcgcaatat 

gtagctgttg 

ggatacatag 

tacgatccca 

gcggtactta 

acaagacaag 



gctacacagg 

tagattttag 

taaatcttga 

aggacatccc 

ttaagacaaa 

aaaaagcgat 

gatggtactg 

tagtatggaa 

gtctctatgc 

ttgtaaaata 

ataagagggg 

tcattactcg 

aagctagagt 

taaaagaagt 

atgagcagat 

caaagaaact 

tacttagagg 

aaaagcacaa 

ggatattgga 

tcggcctaac 



tggattcgtt 

agccctatat 

gggatgcaag 

tggttttata 

aatgaaggaa 

aaaactctta 

taaggagtgt 

ggagctcgaa 

aactatccca 

cataaattca 

attcttcgtt 

tggtttagag 

tttggagaca 

aatacaaaag 

aacafagacca 

agctgctaaa 

cgatggtcca 

gtatgacgca 

gggatttgga 

ttcctggctt 



aaagagccag aaaaggggtt gtgggaaaac 1200 
ccctcgatta taattaccca caatgtttct 1260 
aactatgata tcgctcctca agtaggccac 1320 
ccaagtctct tgggacattt gttagaggaa 1380 
actcaagatc ctatagaaaa aatactcctt 1440 
gcaaattctt tctacggata ttatggctat 1500 
gctgagagcg ttactgcctg gggaagaaag 1560 
gaaaagtttg gatttaaagt cctctacatt 1620 
ggaggagaaa gtgaggaaat aaagaaaaag 1680 
aagctccctg gactgctaga gcttgaatat 1740 
acgaagNNNa ggtatgcagt aatagatgaa 1800 
atagttagga gagattggag tgaaattgca 1860 
atactaaaac acggagatgt tgaagaagct 1920 
cttgccaatt atgaaattcc accagagaag 1980 
ttacatgagt ataaggcgat aggtcctcac 2040 
ggagttaaaa taaagccagg aatggtaatt: 2100 
attagcaata gggcaattct agctgaggaa 2160 
gaatattaca tggagaacca ggttcttcca 2220 
tacagaaagg aagacctcag ataccaaaag 2280 
aacattaaaa aatcctag 2328. (SEQ ID NO. 



96] 
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KOD DKA polymerase wild type 

atgatcctcg acactgacta cataaccgag* gatggaaagc ctgtcataag aattttcaag 60 
aaggaaaacg gcgagtttaa gattgagtac gaccggactt ttgaacccta cttctacgcc 120 
ctcctgaagg acgattctgc cattgaggaa gtcaagaaga taaccgccga gaggcacggg 180 
acggttgtaa cggttaagcg ggttgaaaag gttcagaaga agttcctcgg gagaccagtt 240 
gaggtctgga aactctactt tactcatccg caggacgtcc cagcgataag ggacaagata 300 
cgagagcatc cagcagttat tgacatctac gagtacgaca tacccttcgc caagcgctac 360 
ctcatagaca agggattagt gccaatggaa ggcgacgagg agctgaaaat gctcgccttc 4 20 
gacattgaaa ctctctacca tgagggcgag gagttcgccg aggggccaat ccttatgata 4 80 
agctacgccg acgaggaagg ggccagggtg ataacttgga agaacgtgga tctcccctac 54 0 
gttgacgtcg tctcgacgga gagggagatg ataaagcgct tcctccgtgt tgtgaaggag 600 
aaagacccgg acgttctcat aacctacaac ggcgacaact tcgacttcgc ctatctgaaa 660 
aagcgctgtg aaaagctcgg aataaacttc gccctcggaa gggatggaag cgagccgaag 720 
attcagagga tgggcgacag gtttgccgtc gaagtgaagg gacggataca cttcgatctc 7 80 
tatcctgtga taagacggac gataaacctg cccacataca cgcttgaggc cgtttatgaa 840 
gccgtcttcg gtcagccgaa ggagaaggtt tacgctgagg aaataaccac agcctgggaa 900 
accggcgaga accttgagag agtcgcccgc tactcgatgg aagatgcgaa ggtcacatac 960 
gagcttggga aggagttcct tccgatggag gcccagcttt ctcgcttaat cggccagtcc 1020 
ctctgggacg tctcccgctc cagcactggc aacctcgttg agtggttcct cctcaggaag 1080 
gcctatgaga ggaatgagct ggccccgaac aagcccgatg aaaaggagct ggccagaaga 1140 
cggcagagct atgaaggagg ctatgtaaaa gagcccgaga gagggttgtg ggagaacata 1200 
gtgtacctag attttagatc cctgtacccc tcaatcatca tcacccacaa cgtctcgccg 1260 
gatacgctca acagagaagg atgcaaggaa tatgacgttg ccccacaggt cggccaccgc 1320 
ttctgcaagg acttcccagg atttatcccg agcctgcttg gagacctcct agaggagagg 1380 
cagaagataa agaagaagat gaaggccacg attgacccga tcgagaggaa gctcctcgat 14 40 
tacaggcaga gggccatcaa gatcctggca aacagctact acggttacta cggctatgca 1500 
agggcgcgct ggtactgcaa ggagtgtgca gagagcgtaa cggcctgggg aagggagtac 1560 
ataacgatga ccatcaagga gatagaggaa aagtacggct ttaaggtaat ctacagcgac 1620 
accgacggat tttttgccac aatacctgga gccgatgctg aaaccgtcaa aaagaaggct 1680 
atggagttcc tcaagtatat caacgccaaa cttccgggcg cgcttgagct cgagtacgag 1740 
ggcttctaca aacgcggctt cttcgtcacg aagaagaagt atgcggtgat agacgaggaa 1800 
ggcaagataa caacgcgcgg acttgagatt gtgaggcgtg actggagcga gatagcgaaa 1860 
gagacgcagg cgagggttct tgaagctttg ctaaaggacg gtgacgtcga gaaggccgtg 1920 
aggatagtca aagaagttac cgaaaagctg agcaagt^cg aggttccgcc ggagaagctg 1980 
gtgatccacg agcagataac gagggattta aaggactaca aggcaaccgg tccccacgtt 2040 
gccgttgcca agaggttggc cgcgagagga gtcaaaatac gccctggaac ggtgataagc 2100 
tacatcgtgc tcaagggctc tgggaggata ggcgacaggg cgataccgtt cgacgagttc 2160 
gacccgacga agcacaagta cgacgccgag tactacattg agaaccaggt tctcccagcc 2220 
gttgagagaa ttctgagagc cttcggttac cgcaaggaag acctgcgcta ccaga^gacg 2280 
agacaggttg gtttgagtgc ttggctgaag ccgaagggaa cttga 2325 [SEQ ID NO. 97] ' 

KOD Y384N NNN=AAT, AAC (All possil^le N codpns) 

KOD y384L NNN=TTA, TTG^ CTT, CTC, CTA, CTG (All possible I. codons) 
KOD y384H NNN== CAT, CAC (All possi±>le H codons) 
KDD Y384Q NNN= GAA, CAG (All possible Q codons) 

KOD Y384S NNN= TCT, TCC, TCA, a?CG, AGO?, AGC (All possible S codons) 
atgatcctcg acactgacta cataaccgag gatggaaagc ctgtcataag aattttcaag 60 
aaggaaaacg gcgagtttaa gattgagtac gaccggactt ttgaacccta cttctacgcc 120 
ctcctgaagg acgattctgc cattgaggaa gtcaagaaga taaccgccga gaggcacggg 180 
acggttgtaa cggttaagcg ggttgaaaag gttcagaaga agttcctcgg gagaccagtt 240 
gaggtctgga aactctactt tactcatccg caggacgtcc cagcgataag ggacaagata 300 
cgagagcatc cagcagttat tgacatctac gagtacgaca tacccttcgc caagcgctac 360 
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ctcatagaca agggattagt gccaatggaa ggcgacgagg agctgaaaat gctcgccttc 4 20 
gacattgaaa ctctctacca tgagggcgag gagttcgccg aggggccaat ccttatgata 480 
agctacgccg acgaggaagg ggccagggtg ataacttgga agaacgtgga tctcccctac 540 
gttgacgtcg tctcgacgga gagggagatg ataaagcgct tcctccgtgt tgtgaaggag 600 
aaagacccgg acgttctcat aacctacaac ggcgacaact tcgacttcgc ctatctgaaa 660 
aagcgctgtg aaaagctcgg aataaacttc gccctcggaa gggatggaag cgagccgaag 720 
attcagagga tgggcgacag gtttgccgtc gaagtgaagg gacggataca cttcgatctc 780 
tatcctgtga taagacggac gataaacctg cccacataca cgcttgaggc cgtttatgaa 840 
gccgtcttcg gtcagccgaa ggagaaggtt tacgctgagg aaataaccac agcctgggaa 900 
accggcgaga accttgagag agtcgcccgc tactcgatgg aagatgcgaa ggtcacatac 960 
gagcttggga aggagttcct tccgatggag gcccagcttt ctcgcttaat cggccagtcc 1020 
ctctgggacg tctcccgctc cagcactggc aacctcgttg agtggttcct cctcaggaag 1080 
gcctatgaga ggaatgagct ggccccgaac aagcccgatg aaaaggagct ggccagaaga 1140 
cggcagagcN imgaaggagg ctatgtaaaa gagcccgaga gagggttgtg ggagaacata 1200 
gtgtacctag attttagatc cctgtacccc tcaatcatca tcacccacaa cgtctcgccg 1260 
gatacgctca acagagaagg atgcaaggaa tatgacgttg ccccacaggt cggccaccgc 1320 
ttctgcaagg acttcccagg atttatcccg agcctgcttg gagacctcct agaggagagg 1380 
cagaagataa agaagaagat gaaggccacg attgacccga tcgagaggaa gctcctcgat 1440 
tacaggcaga gggccatcaa gatcctggca aacagctact acggttacta cggctatgca 1500 
agggcgcgct ggtactgcaa ggagtgtgca gagagcgtaa cggcctgggg aagggagtac 1560 
ataacgatga ccatcaagga gatagaggaa aagtacggct ttaaggtaat ctacagcgac 1620 
accgacggat tttttgccac aatacctgga gccgatgctg aaaccgtcaa aaagaaggct 1680 
atggagttcc tcaagtatat caacgccaaa cttccgggcg cgcttgagct cgagtacgag 1740 
ggcttctaca aacgcggctt cttcgtcacg aagaagaagt atgcggtgat agacgaggaa 1800 
ggcaagataa caacgcgcgg acttgagatt gtgaggcgtg actggagcga gatagcgaaa 1860 
gagacgcagg cgagggttct tgaagctttg ctaaaggacg gtgacgtcga gaaggccgtg 1920 
aggatagtca aagaagttac cgaaaagctg agcaagtacg aggttccgcc ggagaagctg 1980 
gtgatccacg agcagataac gagggattta aaggactaca aggcaaccgg tccccacgtt 2040 
gccgttgcca agaggttggc cgcgagagga gtcaaaatac gccctggaac ggtgataagc 2100 
tacatcgtgc tcaagggctc tgggaggata ggcgacaggg cgataccgtt cgacgagttc 2160 
gacccgacga agcacaagta cgacgccgag tactacattg agaaccaggt tctcccagcc 2220 
gttgagagaa ttctgagagc cttcggttac cgcaaggaag acctgcgcta ccagaagacg 2280 
agacaggttg gtttgagtgc ttggctgaag ccgaagggaa cttga 2325 [SEQ ID NO. 98] 

KOD G386S NNN= TCT, TCC, TCA, TCG, AGT, AGC (AIX possible S codons) 
KOD G386P NNN= CCT, CCA, CCG, CCC (All pbssible P codons) 
atgatcctcg acactgacta cataaccgag gatggaaagc ctgtcataag aattttcaag 60 
aaggaaaacg gcgagtttaa gattgagtac gaccggactt ttgaacccta cttctacgcc 120 
ctcctgaagg acgattctgc cattgaggaa gtcaagaaga taaccgccga gaggcacggg 180 
acggttgtaa cggttaagcg ggttgaaaag gttcagaaga agttcctcgg gagaccagtt 240 
gaggtctgga aactctactt tactcatccg caggacgtcc cagcgataag ggacaagata 300 
cgagagcatc cagcagttat tgacatctac gagtacgaca tacccttcgc caagcgctac 360 
ctcatagaca agggattagt gccaatggaa ggcgacgagg agctgaaaat gctcgccttc 420 
gacattgaaa ctctctacca tgagggcgag gagttcgccg aggggccaat ccttatgata 4 80 
agctacgccg acgaggaagg ggccagggtg ataacttgga agaacgtgga tctcccctac 540 
gttgacgtcg tctcgacgga gagggagatg ataaagcgct tcctccgtgt tgtgaaggag 600 
aaagacccgg acgttctcat aacctacaac ggcgacaact tcgacttcgc ctatctgaaa 660 
aagcgctgtg aaaagctcgg aataaacttc gccctcggaa gggatggaag cgagccgaag 720 
attcagagga tgggcgacag gtttgccgtc gaagtgaagg gacggataca cttcgatctc 780 
tatcctgtga taagacggac gataaacctg cccacataca cgcttgaggc cgtttatgaa 840 
gccgtcttcg gtcagccgaa ggagaaggtt tacgctgagg aaataaccac agcctgggaa 900 
accggcgaga accttgagag agtcgcccgc tactcgatgg aagatgcgaa ggtcacatac 960 
gagcttggga aggagttcct tccgatggag gcccagcttt ctcgcttaat" cggccagtcc 1020 
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ctctgggacg tctcccgctc cagcactggc aacctcgttg agtggttcct cctcaggaag 1080 
gcctatgaga ggaatgagct ggccccgaac aagcccgatg aaaaggagct ggccagaaga 1140 
cggcagagct atgaaNNNgg ctatgtaaaa gagcccgaga gagggttgtg ggagaacata 1200 
gtgtacctag attttagatc cctgtacccc tcaatcatca tcacccacaa cgtctcgccg 1260 
gatacgctca acagagaagg atgcaaggaa tatgacgttg ccccacaggt cggccaccgc 1320 
ttctgcaagg acttcccagg' atttatcccg agcctgcttg gagacctcct agaggagagg 1380 
cagaagataa agaagaagat gaaggccacg attgacccga tcgagaggaa gctcctcgat 1440 
tacaggcaga gggccatc.aa gatcctggca aacagctact acggttacta cggctatgca 1500 
agggcgcgct ggtactgcaa ggagtgtgca gagagcgtaa cggcctgggg aagggagtac 1560 
ataacgatga ccatcaagga gatagaggaa aagtacggct ttaaggtaat ctacagcgac 1620 
accgacggat tttttgccac aatacctgga gccgatgctg aaaccgtcaa aaagaaggct 1680 
atggagttcc tcaagtatat caacgccaaa cttccgggcg cgcttgagct cgagtacgag 1740 
ggcttctaca aacgcggctt cttcgtcacg aagaagaagt atgcggtgat agacgaggaa 1800 
ggcaagataa caacgcgcgg acttgagatt gtgaggcgtg actggagcga gatagcgaaa 1860 
gagacgcagg cgagggttct tgaagctttg ctaaaggacg gtgacgtcga gaaggccgtg 1920 
aggatagtca aagaagttac cgaaaagctg agcaagtacg aggttccgcc ggagaagctg 1980 
gtgatccacg agcagataac gagggattta aaggactaca aggcaaccgg -tccccacgtt 2040 
gccgttgcca agaggttggc cgcgagagga gtcaaaatac gccctggaac ggtgataagc 2100 
tacatcgtgc tcaagggctc tgggaggata ggcgacaggg cgataccgtt cgacgagttc 2160 
gacccgacga agcacaagta cgacgccgag tactacattg agaaccaggt tctcccagcc 2220 
gttgagagaa ttctgagagc cttcggttac cgcaaggaag acctgcgcta ccagaagacg 2280 
agacaggttg gtttgagtgc ttggctgaag ccgaagggaa cttga 2325 [SEQ ID NO. 99] 

KOD G387A NNN= GCA, GCT, GCC, GCG (All possible A codons) 
KOD G387P NNN= CCT, CCA^ CCG, CCC (All possible P codons) 
atgatcctcg acactgacta cataaccgag gatggaaagc ctgtcataag aattttcaag 60 
aaggaaaacg gcgagtttaa gattgagtac gaccggactt ttgaacccta cttctacgcc 120 
ctcctgaagg acgattctgc cattgaggaa gtcaagaaga taaccgccga gaggcacggg 180 
acggttgtaa cggttaagcg ggttgaaaag gttcagaaga agttcctcgg gagaccagtt 240 
gaggtctgga aactctactt tactcatccg caggacgtcc cagcgataag ggacaagata 300 
cgagagcatc cagcagttat tgacatctac gagtacgaca tacccttcgc caagcgctac 360 
ctcatagaca agggattagt gccaatggaa ggcgacgagg agctgaaaat gctcgccttc 420 
gacattgaaa ctctctacca tgagggcgag gagttcgccg aggggccaat ccttatgata 480 
agctacgccg acgaggaagg ggccagggtg ataacttgga agaacgtgga tctcccctac 540 
gttgacgtcg tctcgacgga gagggagatg ataaagcgct tcctccgtgt tgtgaaggag 600 
aaagacccgg acgttctcat aacctacaac ggcgacaact: tcgacttcgc ctatctgaaa 660 
aagcgctgtg aaaagctcgg aataaacttc gccctcggaa gggatggaag cgagccgaag 720 
attcagagga tgggcgacag gtttgccgtc gaagtgaagg gacggataca cttcgatctc 780 
tatcctgtga taagacggac gataaacctg cccacataca cgcttgaggc cgtttatgaa 840 
gccgtcttcg gtcagccgaa ggagaaggtt tacgctgagg aaataaccac agcctgggaa 900 
accggcgaga accttgagag agtcgcccgc tactcgatgg aagatgcgaa ggtcacatac 960 
gagcttggga aggagttcct tccgatggag gcccagcttt ctcgcttaat cggccagtcc 1020 
ctctgggacg tctcccgctc cagcactggc aacctcgttg agtggttcct cctcaggaag 1080 
gcctatgaga ggaatgagct ggccccgaac aagcccgatg aaaaggagct ggccagaaga 1140 
cggcagagct atgaaggaKN Ktatgtaaaa gagcccgaga gagggttgtg ggagaacata 1200 
gtgtacctag attttagatc cctgtacccc tcaatcatca tcacccacaa cgtctcgccg 1260 
gatacgctca acagagaagg atgcaaggaa tatgacgttg ccccacaggt cggccaccgc 1320 
ttctgcaagg acttcccagg atttatcccg agcctgcttg gagacctcct agaggagagg 1380 
cagaagataa agaagaagat gaaggccacg attgacccga tcgagaggaa gctcctcgat 1440 
tacaggcaga gggccatcaa gatcctggca aacagctact acggt.tacta cggctatgca 1500 
agggcgcgct ggtactgcaa ggagtgtgca gagagcgtaa cggcctgggg aagggagtac 1560 
ataacgatga ccatcaagga gatagaggaa aagtacggct ttaaggtaat ctacagcgac 1620 
accgacggat tttttgccac aatacctgga gccgatgctg aaaccgtcaa aaagaaggct 1680 
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atggagttcc 
ggcttctaca 
ggcaagataa 
gagacgcagg 
aggatagtca 
gtgatccacg 
gccgttgcca 
tacatcgtgc 
gacccgacga 
gttgagagaa 
agacaggttg 



tcaagtatat 
aacgcggctt 
caacgcgcgg 
cgagggttct 
aagaagttac 
agcagataac 
agaggttggc 
tcaagggctc 
agcacaagta 
ttctgagagc 
gtttgagtgc 



caacgccaaa 
cttcgtcacg 
acttgagatt 
tgaagctttg 
cgaaaagctg 
gagggattta 
cgcgagagga 
tgggaggata 
cgacgccgag 
cttcggttac 
ttggctgaag 



cttccgggcg 
aagaagaagt 
gtgaggcgtg 
ctaaaggacg 
agcaagtacg 
aaggactaca 
gtcaaaatac 
ggcgacaggg 
tactacattg 
cgcaaggaag 
ccgaagggaa 



cgcttgagct 
atgcggtgat 
actggagcga 
gtgacgtcga 
aggttccgcc 
aggcaaccgg 
gccctggaac 
cgataccgtt 
agaaccaggt 
acctgcgcta 
cttga 2325 



cgagtacgag 1740 
agacgaggaa 1800 
gatagcgaaa 18 60 
gaaggccgtg 1920 
ggagaagctg 1980 
tccccacgtt 2040 
ggtgataagc 2100 
cgacgagttc 2160 
tctcccagcc 2220 
ccagaagacg 2280 
[SEQ ID NO. 100] 



KOD D404E NNN= GAA, GAG (All possi±>le E codons) 
atgabcctcg acactgacta cataaccgag gatggaaagc ctgtcataag 
aaggaaaacg gcgagtttaa gattgagtac gaccggactt ttgaacccta 
ctcctgaagg acgattctgc cattgaggaa gtcaagaaga taaccgccga 
acggttgtaa cggttaagcg ggttgaaaag gttcagaaga agttcctcgg 
gaggtctgga aactctactt tactcatccg caggacgtcc cagcgataag 
cgagagcatc cagcagttat tgacatctac gagtacgaca tacccttcgc 
ctcatagaca agggattagt gccaatggaa ggcgacgagg agctgaaaat: 
gacattgaaa ctctctacca tgagggcgag gagttcgccg aggggccaat 
agctacgccg acgaggaagg ggccagggtg ataacttgga agaacgtgga 
gttgacgtcg tctcgacgga gagggagatg ataaagcgct tcctccgtgt 
aaagacccgg acgttctcat aacctacaac ggcgacaact tcgacttcgc 
aagcgctgtg aaaagctcgg aataaacttc" gccctcggaa gggatggaag 
attcagagga tgggcgacag gtttgccgtc gaagtgaagg gacggataca 
tatcctgtga taagacggac gataaacctg cccacataca cgcttgaggc 
gccgtcttcg gtcagccgaa ggagaaggtt tacgctgagg aaataaccac 
accggcgaga accttgagag agtcgcccgc tactcgatgg aagatgcgaa 
gagcttggga aggagttcct tccgatggag gcccagcttt ctcgcttaat 
ctctgggacg tctcccgctc cagcactggc aacctcgttg agtggttcct 
gcctatgaga ggaatgagct ggccccgaac aagcccgatg aaaaggagct 
cggcagagct atgaaggagg ctatgtaaaa gagcccgaga gagggttgtg 
gtgtacctaN NNtttagatc cctgtacccc tcaatcatca tcacccacaa 
gatacgctca acagagaagg atgcaaggaa tatgacgttg ccccacaggt 
ttctgcaagg acttcccagg atttatcccg agcctgcttg gagacctcct 
cagaagataa agaagaagat gaaggccacg attgacccga tcgagaggaa 
tacaggcaga gggccatcaa gatcctggca aacagctact acggttacta 
agggcgcgct ggtactgcaa ggagtgtgca gagagcgtaa cggcctgggg 
ataacgatga ccatcaagga gatagaggaa aagtacggct ttaaggtaat 
accgacggat tttttgccac aatacctgga gccgatigctg aaaccgtcaa 
atggagttcc tcaagtatat caacgccaaa cttccgggcg cgcttgagct 
ggcttctaca aacgcggctt cttcgtcacg aagaagaagt atgcggtgat 
ggcaagataa caacgcgcgg acttgagatt gtgaggcgtg actggagcga 
gagacgcagg cgagggttct tgaagctttg ctaaaggacg gtgacgtcga 
aggatagtca aagaagttac cgaaaagctg agcaagtacg aggttccgcc 
gtgatccacg agcagataac gagggattta aaggactaca aggcaaccgg 
gccgttgcca agaggttggc cgcgagagga gtcaaaatac gccctggaac 
tacatcgtgc tcaagggctc tgggaggata ggcgacaggg cgataccgtt 
gacccgacga agcacaagta cgacgccgag tactacattg agaaccaggt 
gttgagagaa ttctgagagc cttcggttac cgcaaggaag acctgcgcta 
agacaggttg gtttgagtgc ttggctgaag ccgaagggaa cttga 2325 



aattttcaag 60 
cttctacgcc 120 
gaggcacggg 180 
gagaccagtt 240 
ggacaagata 300 
caagcgctac 360 
gctcgccttc 420 
ccttatgata 480 
tctcccctac 540 
tgtgaaggag 600 
ctatctgaaa 660 
cgagccgaag 720 
cttcgatctc 780 
cgtttatgaa 840 
agcctgggaa 900 
ggtcacatac 960 
cggccagtcc 1020 
cctcaggaag 1080 
ggccagaaga 114 0 
ggagaacata 1200 
cgtctcgccg 1260 
cggccaccgc 1320 
agaggagagg 1380 
gctcctcgat 1440 
cggctatgca 1500 
aagggagtac 1560 
ctacagcgac 1620 
aaagaaggct 1680 
cgagtacgag 1740 
agacgaggaa 1800 
gatagcgaaa 1860 
gaaggccgtg 1920 
ggagaagctg 1980 
tccccacgtt 2040 
ggtgataagc 2100 
cgacgagttc 2160 
tctcccagcc 2220 
ccagaagacg 2280 
[SEQ ID NO. 101] 
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KOD T541P NNN= CCT, CCA, CCG, CCC (All possible P codons) 
atgatcctcg acactgacta cataaccgag gatggaaagc ctgtcataag aattttcaag 60 
aaggaaaacg gcgagtttaa gattgagtac gaccggactt ttgaacccta cttctacgcc 120 
ctcctgaagg acgattctgc cattgaggaa gtcaagaaga taaccgccga gaggcacggg 180 
acggttgtaa cggttaagcg ggttgaaaag gttcagaaga agttcctcgg gagaccagtt 240 
gaggtctgga aactctactt tactcatccg caggacgtcc cagcgataag ggacaagata 300 
cgagagcatc cagcagttat tgacatctac gagtacgaca tacccttcgc caagcgctac 360 
ctcatagaca agggattagt gccaatggaa ggcgacgagg agctgaaaat gctcgccttc 420 
gacattgaaa ctctctacca tgagggcgag gagttcgccg aggggccaat ccttatgata 4 80 
agctacgccg acgaggaagg ggccagggtg ataacttgga agaacgtgga tctcccctac 540 
gttgacgtcg tctcgacgga gagggagatg ataaagcgct tcctccgtgt tgtgaaggag 600 
aaagacccgg acgttctcat aacctacaac ggcgacaact tcgacttcgc ctatctgaaa 660 
aagcgctgtg aaaagctcgg aataaacttc gccctcggaa gggatggaag cgagccgaag 720 
attcagagga tgggcgacag gtttgccgtc gaagtgaagg gacggataca cttcgatctc 780 
tatcc-tgtga taagacggac gataaacctg cccacataca cgcttgaggc cgtttatgaa 840 
gccgtcttcg gtcagccgaa ggagaaggtt tacgctgagg aaataaccac agcctgggaa 900 
accggcgaga accttgagag agtcgcccgc tactcgatgg aagatgcgaa ggtcacatac 960 
gagci:tggga aggagttcct tccgatggag gcccagcttt ctcgcttaat cggccagtcc 1020 
ctctgggacg tctcccgctc cagcactggc aacctcgttg agtggttcct cctcaggaag 1080 
gcctatgaga ggaatgagct ggccccgaac aagcccgatg aaaaggagct ggccagaaga 1140 
cggcagagct atgaaggagg ctatgtaaaa gagcccgaga gagggttgtg ggagaacata 1200 
gtgtacctag attttagatc cctgtacccc tcaatcatca tcacccacaa cgtctcgccg 1260 
gatacgctca acagagaagg atgcaaggaa tatgacgttg ccccacaggt cggccaccgc 1320 
ttctgcaagg acttcccagg atttatcccg agcctgcttg gagacctcct agaggagagg 1380 
cagaagataa agaagaagat gaaggccacg attgaccega tcgagaggaa gctcctcgat 1440 
tacaggcaga gggccatcaa gatcctggca aacagctact acggttacta cggctatgca 1500 
agggcgcgct ggtactgcaa ggagtgtgca gagagcgtaa cggcctgggg aagggagtac 1560 
ataacgatga ccatcaagga ga1:agaggaa aagtacggct ttaaggtaat ctacagcgac 1620 
NNNgacggat tttttgccac aatacctgga gccgatgctg aaaccgtcaa aaagaaggct 1680 
atggagttcc tcaagtatat caacgccaaa cttccgggcg cgcttgagct cgagtacgag 1740 
ggcttctaca aacgcggctt cttcgtcacg aagaagaagt atgcggtgat agacgaggaa 1800 
ggcaagataa caacgcgcgg acttgagatt gtgaggcgtg actggagcga gatagcgaaa 1860 
gagacgcagg cgagggttct tgaagctttg ctaaaggacg gtgacgtcga gaaggccgtg 1920 
aggatagtca aagaagttac cgaaaagctg agcaagtacg aggttccgcc ggagaagctg 1980 
gtgatccacg agcagataac gagggattta aaggactaca aggcaaccgg tccccacgtt: 2040 
gccgttgcca agaggttggc cgcgagagga gtcaaaatac gccctggaac ggtgataagc 2100 
tacatcgtgc tcaagggctc tgggaggata ggcgacaggg cgataccgtt cgacgagttc 2160 
gacccgacga agcacaagta cgacgccgag tactacattg agaaccaggt tctcccagcc 2220 
gttgagagaa ttctgagagc cttcggttac cgcaaggaag acctgcgcta ccagaagacg 2280 
agacaggttg gtttgagtgc ttggctgaag ccgaagggaa cttga 2325 [SEQ ID NO. 102) 

KOD D542G NNN^GGT, GGC, GGA, GGG (All pos^sible G codons) 
atgatcctcg acactgacta cataaccgag gatggaaagc ctgtcataag aattttcaag 60 
aaggaaaacg gcgagtttaa gattgagtac gaccggactt ttgaacccta cttctacgcc 120 
ctcctgaagg acgattctgc cattgaggaa gtcaagaaga taaccgccga gaggcacggg 180 
acggttgtaa cggttaagcg ggttgaaaag gttcagaaga agtt<?ctcgg gagaccagtt 240 
gaggtctgga aactctactt tactcatccg caggacgtcc cagcgataag ggacaagata 300 
cgagagcatc cagcagttat tgacatctac gagtacgaca tacccttcgc caagcgctac 360 
ctcatagaca agggattagt gccaatggaa ggcgacgagg agctgaaaat gctcgccttc 420 
gacattgaaa ctctctacca tgagggcgag gagttcgccg aggggccaat ccttatgata 480 
agctacgccg acgaggaagg ggccagggtg ataacttgga agaacgtgga tctcccctac 540 
gttgacgtcg tctcgacgga gagggagatg ataaagcgct tcctccgtgt tgtgaaggag 600 
aaagacccgg acgttctcat aacctacaac ggcgacaact tcgacttcgc ctatctgaaa 660 
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aagcgctgtg 

attcagagga 

tatcctgtga 

gccgtcttcg 

accggcgaga 

gagcttggga 

ctctgggacg 

gcctatgaga 

cggcagagct 

gtgtacctag. 

gatacgctca 

ttctgcaagg 

cagaagataa 

tacaggcaga' 

agggcgcgct 

ataacgatga 

accNNNggat 

atggagttcc 

ggcttctaca 

ggcaagaliaa 

gagacgcagg 

aggatagtca 

gtgatccacg 

gccgttgcca 

tacatcgtgc 

gacccgacga 

gttgagagaa 

agacaggttg 



aaaagctcgg 
tgggcgacag 
taagacggac 
gtcagccgaa 
accttgagag 
aggagttcct 
tctcccgctc 
ggaatgagct 
atgaaggagg 
attttagatc 
acagagaagg 
acttcccagg 
agaagaagat 
gggccatcaa 
ggtactgcaa 
ccatcaagga 
tttttgccac 
tcaagtatat 
aacgcggctt 
caacgcgcgg 
cgagggttct 
aagaagttac 
agcagataac 
agaggttggc 
tcaagggctc 
agcacaag1:a 
ttctgagagc 
gtttgagtgc 



aataaacttc 
gtttgccgtc 
gataaacctg 
ggagaaggtt 
agtcgcccgc 
tccgatggag 
cagcactggc 
ggccccgaac 
ctatgtaaaa 
cctgtacccc 
atgcaaggaa 
atttatccGg 
gaaggccacg 
gatcctggca 
ggagtgtgca 
gatagaggaa 
aatacctgga 
caacgccaaa 
cttcgtcacg 
acttgagatt 
tgaagctttg 
cgaaaagctg 
gagggattta 
cgcgagagga 
tgggaggata 
cgacgccgag 
cttcggttac 
ttggctgaag 



KOD K592T 1<INN'=ACT, ACC, ACA, ACG 
atgatcctcg acactgacta cataaccgag 
aaggaaaacg gcgagtttaa gattgagtac 
ctcctgaagg acgattctgc cattgaggaa 
acggttgtaa cggttaagcg ggttgaaaag 
gaggtctgga aactctactt tactcatccg 
cgagagcatc cagcagttat tgacatctac 
ctcatagaca agggattagt gccaatggaa 
gacattgaaa ctctctacca tgagggcgag 
agctacgccg acgaggaagg ggccaggglig 
gttgacgtcg tctcgacgga gagggagatg 
aaagacccgg. acgttctcat aacctacaac 
aagcgctgtg aaaagctcgg aataaacttc 
attcagagga tgggcgacag gtttgccgtc 
tatcctgtga taagacggac gataaacctg 
gccgtcttcg gtcagccgaa ggagaaggtt 
accggcgaga accttgagag agtcgcccgc 
gagcttggga aggagttcct tccgatggag 
ctctgggacg tctcccgctc cagcactggc 
gcctatgaga ggaatgagct ggccccgaac 
cggcagagct atgaaggagg ctatgtaaaa 
gtgtacctag attttagatc cctgtacccc 
gatacgctca acagagaagg atgcaaggaa 
ttctgcaagg acttcccagg atttatcccg 



gccctcggaa gggatggaag cgagccgaag 720 
gaagtgaagg gacggataca cttcgatctc 780 
cccacataca cgcttgaggc cgtttatgaa 840 
tacgctgagg aaataaccac agcctgggaa 900 
tactcgatgg aagatgcgaa ggtcacatac 960 
gcccagcttt ctcgcttaat cggccagtcc 1020 
aacctcgttg agtggttcct cctcaggaag 1080 
aagcccgatg aaaaggagct ggccagaaga 1140 
gagcccgaga gagggttgtg ggagaacata 1200 
tcaatcatca tcacccacaa cgtctcgccg 1260 
tatgacgttg ccccacaggt cggccaccgc 1320 
agcctgcttg gagacctcct agaggagagg 1380 
attgacccga tcgagaggaa gctcctcgat 144 0 
aacagctact acggttacta cggctatgca 1500 
gagagcgtaa cggcctgggg aagggagtac 1560 
aagtacggct ttaaggtaat ctacagcgac 1620 
gccgatgctg aaaccgtcaa aaagaaggct 1680 
cttccgggcg cgcttgagct cgagtacgag 1740 
aagaagaagt atgcggtgat agacgaggaa 1800 
gtgaggcgtg actggagcga gatagcgaaa 1860 
ctaaaggacg gtgacgtcga gaaggccgtg 1920 
agcaagtacg aggttccgcc ggagaagctg 1980 
aaggactaca aggcaaccgg tccccacgtt 2040 
gtcaaaatac gccctggaac ggtgataagc 2100 
ggcgacaggg cgataccgtt cgacgagttc 2160 
tactacattg agaaccaggt tctcccagcc 2220 
cgcaaggaag acctgcgcta ccagaagacg 2280 
ccgaagggaa cttga 2325 [SEQ ID NO. 103] 

(All possible T codoixs) 

gatggaaagc ctgtcataag aattttcaag 60 

gaccggactt ttgaacccta cttctacgcc 120 

gtcaagaaga taaccgccga gaggcacggg 180 

gttcagaaga agttcctcgg gagaccagtt 240 

caggacgtcc cagcgataag ggacaagata 300 

gagtacgaca tacccttcgc caagcgctac 360 

ggcgacgagg agctgaaaat gctcgccttc 420 

gagttcgccg aggggccaat ccttatgata 4 80 

ataacttgga agaacgtgga tctcccctac 540 

ataaagcgct tcctccgtgt tgtgaaggag 600 

ggcgacaact tcgacttcgc ctatctgaaa 660 

gccctcggaa gggatggaag cgagccgaag 720 

gaagtgaagg gacggataca cttcgatctc 780 

cccacataca cgcttgaggc cgtttatgaa 840 

tacgctgagg aaataaccac agcctgggaa 900 

tactcgatgg aagatgcgaa ggtcacatac 960 

gcccagcttt ctcgcttaat cggccagtcc 1020 

aacctcgttg agtggttcct cctcaggaag 1080 

aagcccgatg aaaaggagct ggccagaaga 1140 

gagcccgaga gagggttgtg ggagaacata 1200 

tcaatcatca tcacccacaa cgtctcgccg 1260 

tatgacgttg ccccacaggt cggccaccgc 1320 

agcctgcttg gagacctcct agaggagagg 1380 
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cagaagataa 
tacaggcaga 
agggcgcgct 
ataacgatga 
accgacggat 
atggagttcc 
ggcttctaca 
ggcaagataa 
gagacgcagg 
aggatagtca 
gtgatccacg 
gccgttgcca 
tacatcgtgc 
gacccgacga 
gttgagagaa 
agacaggttg 



agaagaagat 
gggccatcaa 
ggtactgcaa 
ccatcaagga 
tttttgccac 
tcaagtatat 
aacgcggctt 
caacgcgcgg 
cgagggttct 
aagaagttac 
agcagataac 
agaggttggc 
tcaagggctc 
agcacaagta 
ttctgagagc 
gtttgagtgc 



gaaggccacg 
gatcctggca 
ggagtgtgca 
gatagaggaa 
aatacctgga 
caacgccaaa 
cttcgtcacg 
acttgagatt 
tgaagctttg 
cgaaaagclig 
gagggattta 
cgcgagagga 
tgggaggata 
cgacgccgag 
cttcggtt'ac 
ttggctgaag 



attgacccga 
aacagctact 
gagagcgtaa 
aagtacggct 
gccgatgctg 
cttccgggcg 
aagNNNaagt 
gtgaggcgrtg 
ctaaaggacg 
agcaagtacg 
aaggactaca 
gtcaaaaiiac 
ggcgacaggg 
tactacattg 
cgcaaggaag 
ccgaagggaa 



tcgagaggaa 
acggttacta 
cggcctgggg 
ttaaggtaat 
aaaccgtcaa 
cgcttgagct 
atgcggtgat 
actggagcga 
gtgacgtcga 
aggttccgcc 
aggcaaccgg 
gccctggaac 
cgataccgtt 
agaaccaggt 
acctgcgcta 
cttga 2325 



gctcctcgat 14 40 
cggctatgca 1500 
aagggagtac 15 60 
ctacagcgac 1620 
aaagaaggct 1680 
cgagtacgag 17 40 
agacgaggaa 1800 
gatagcgaaa 18 60 
gaaggccgtg 1920 
ggagaagctg 1980 
tccccacgtt 2040 
ggtgataagc 2100 
cgacgagttc 2160 
tctcccagcc 2220 
ccagaagacg 2280 
ISEQ ID NO. 104] 
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Vent DNA polymerase wild type 

atgatactgg, acactgatta cataacaaaa gatggcaagc ctataatccg aatttttaag 60 
aaagagaacg gggagtttaa aatagaactt gaccctcatt ttcagcccta tatatat-gct 120 
cttctcaaag atgactccgc tattgaggag ataaaggcaa taaagggcga gagacatgga 180 
aaaactgtga gagtgctcga tgcagtgaaa gtcaggaaaa aatttttggg aagggaagtt 240 
gaagtctgga agctcatttt cgagcatccc caagacgttc cagctatgcg gggcaaaata 300 
agggaacatc cagctgtggt tgacatttac gaatatgaca taccctttgc caagcgttat 360 
ctcatagaca agggcttgat tcccatggag ggagacgagg agcttaagct ccttgccttt 420 
gatattgaaa cgttttatca tgagggagat gaatttggaa agggcgagat aataatgatt 480 
agttatgccg atgaagaaga ggccagagta atcacatgga aaaatatcga t:ttgccgtat 540 
gtcgatgttg tgtccaatga aagagaaatg ataaagcgtt ttgttcaagt tgttaaagaa 600 
aaagaccccg atgtgataat aacttacaat ggggacaatt ttgatttgcc gtatctcata 660 
aaacgggcag aaaagctggg agttcggctt gtcttaggaa gggacaaaga acatcccgaa 720 
cccaagattc agaggatggg tgatagtttt gctgtggaaa tcaagggtag aatccacttt 780 
gatcttttcc cagttgtgcg aaggacgata aacctcccaa cgtatacgct tgaggcagtt 840 
tatgaagcag 1:tt:taggaaa aaccaaaagc aaattaggag cagaggaaat tgccgctata 900 
tgggaaacag aagaaagcat gaaaaaacta gcccagtact caatggaaga tgctagggca 960 
acgtatgagc tcgggaagga attcttcccc atggaagctg agctggcaaa gctgataggt 1020 
caaagtgtat gggacgtctc gagatcaagc accggcaacc tcgtggagtg gtatctttta 1080 
agggtggcat acgcgaggaa tgaacttgca ccgaacaaac ctgatgagga agagtataaa , 1140 
cggcgcttaa gaacaactta cctgggagga tatgtaaaag agccagaaaa aggtttgtgg 1200 
gaaaatatca tttatttgga tttccgcagt ctgtaccctt caataatagt tactcacaac 1260 
gtatccccag atacccttga aaaagagggc tgtaagaatt acgatgttgc tccgatagta 1320 
ggatataggt tctgcaagga ctttccgggc tttattccct ccatactcgg ggacttaatt 1380 
gcaatgaggc aagatataaa gaagaaaatg aaatccacaa ttgacccgat cgaaaagaaa 1440 
atgctcgatt ataggcaaag ggctattaaa ttgcttgcaa acagctatta cggctatatg 1500 
gggtatccta aggcaagatg gtactcgaag gaatgtgctg aaagcgttac cgcatggggg 1560 
agacactaca tagagatgac gataagagaa atagaggaaa agttcggctt: taaggttctt 1620 
tatgcggaca ctgacggctt ttatgccaca atacccgggg aaaagcctga actcattaaa 1680 
aagaaagcca aggaattcct aaactacata aactccaaac ttccaggtct gcttgagctt 1740 
gagtatgagg gcttttactt gagaggattc tttgttacaa aaaagcgcta tgcagtcata 1800 
gatgaagagg gcaggataac aacaaggggc ttggaagtag taaggagaga ttggagtgag 1860 
atagctaagg agactcaggc aaaggtttta gaggctatac ttaaagaggg aagtgttgaa 1920 
aaagctgtag aagttgttag agatgttgta gagaaaatag caaaatacag ggttccactt 1980 
gaaaagcttg ttatccatga gcagattacc agggatttaa aggactacaa agccattggc 2040 
cctcatgtcg cgatagcaaa aagacttgcc gcaagaggga taaaagtgaa accgggcaca 2100 
ataataagct atatcgttct caaagggagc ggaaagataa gcgatagggt aattttactt 2160 
acagaatacg atcctagaaa acacaagtac gatccggact actacataga aaaccaagtt 2220 
ttgccggcag tacttaggat actcgaagcg tttggataca gaaaggagga tttaaggtat 2280 
caaagctcaa- aacaaaccgg cttagatgca tggctcaaga ggtag 2325 [SEQ ID NO. 105] 

Vent Y387N NNN^AAT, AAC (All possi±>le N codons) 

Vent Y387L NNN=TTA/ TTG^ CTT, CTC, CTA, CTG (All possible I. codons). 
Vent y387H HNN~ CAT, CAC (All possible H codons) 
Vent 7387Q iqHNs CAA, GAG (All possible Q codons) 

Vent Y387S NNI^ TCT, TCC, TGA, TCG, A6T, AGC (All possible S oodozis) 



atgatactgg acactgatta cataacaaaa gatggcaagc ctataatccg aatttttaag 60 

aaagagaacg gggagtttaa aatagaactt gaccctcatt ttcagcccta tatatatgct 120 

cttctcaaag atgactccgc tattgaggag ataaaggcaa taaagggcga gagacatgga 180 

aaaactgtga gagtgctcga tgcagtgaaa gtcaggaaaa aatttttggg aagggaagtt 240 

gaagtctgga agctcatttt cgagcatccc caagacgttc cagctatgcg gggcaaaata 300 

agggaacatc cagctgtggt tgacatttac gaatatgaca taccctttgc caagcgttat 360 
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ctcatagaca 
gatattgaaa 
agttatgccg 
gtcgatgttg 
aaagaccccg 
aaacgggcag 
cccaagattc 
gatcttttcc 
tatgaagcag 
tgggaaacag 
acgtatgagc 
caaagtgtat 
agggtggcat 
cggcgcttaa 
gaaaatatca 
gtatccccag 
ggatataggt 
gcaatgaggc 
atgctcgatt 
gggrtatccta 
agacactaca 
tat:gcggaca 
aagaaagcca 
gagtatgagg 
gatgaagagg 
atagctaagg 
aaagctgtag 
gaaaagcbtg 
cctcatgtcg 
atiaataagct 
acagaatacg 
ttgccggcag 
caaagctcaa 

Vent G389S 
Vent G389P 
atgatactgg 
aaagagaacg 
cttctcaaag 
aaaactgtga 
gaagtctgga 
^gggaacatc 
ctcatagaca 
gatattgaaa 
agttatgccg 
gtcgatgttg 
aaagaccccg 
aaacgggcag 
cccaagattc 
gatcttttcc 
tatgaagcag 
tgggaaacag 
acgtatgagc 



agggcttgat 
cgttttatca 
atgaagaaga 
tgtccaatga 
atgtgataat 
aaaagctggg 
agaggatggg 
cagttgtgcg 
ttttaggaaa 
aagaaagcat 
tcgggaagga 
gggacgtctc 
acgcgaggaa 
gaacaactNN 
tttatttgga 
atacccttga 
tctgcaagga 
aagatataaa 
ataggcaaag 
aggcaagatg 
tagagatgac 
ctgacggctt 
aggaattcct 
gcttttactt 
gcaggataac 
agactcaggc 
aagttgttag 
ttatccatga 
cgatagcaaa 
atatcgttct 
atcctagaaa 
tacttaggat 
aacaaaccgg 



tcccatggag 
tgagggagat 
ggccagagta 
aagagaaatg 
aacttacaat 
agttcggctt 
tgatagtttt 
aaggacgata 
aaccaaaagc 
gaaaaaacta 
attcttcccc 
gagatcaagc 
tgaacttgca 
Nctgggagga 
tttccgcagt 
aaaagagggc 
ctttccgggc 
gaagaaaatg 
ggctattaaa 
gtactcgaag 
gataagagaa 
ttatgccaca 
aaactacata 
gagaggattc 
aacaaggggc 
aaaggtttta 
agatgttgta 
gcagattacc 
aagacttgcc 
caaagggagc 
acacaagtac 
actcgaagcg 
cttagatgca 



ggagacgagg 
gaatttggaa 
atcacatgga 
ataaagcgtt 
ggggacaatt 
gtcttaggaa 
gctgtggaaa 
aacctcccaa 
aaattaggag 
gcccagtact 
atggaagctg 
accggcaacc 
ccgaacaaac 
tatgtaaaag 
ctgtaccctt 
tgtaagaatt 
tttattccct 
aaatccacaa 
ttgcttgcaa 
gaatgtgctg 
atagaggaaa 
atacccgggg 
aactccaaac 
tttgttacaa 
ttggaagtag 
gaggctatac 
gagaaaatag 
agggatttaa 
gcaagaggga 
ggaaagataa 
gatccggact 
tttggataca 
tggctcaaga 



agcttaagct 
agggcgagat 
aaaatatcga 
ttgttcaagt 
ttgatttgcc 
gggacaaaga 
tcaagggtag 
cgtatacgct 
cagaggaaat 
caatggaaga 
agctggcaaa 
tcgtggagtg 
ctgatgagga 
agccagaaaa 
caataatagt 
acgatgttgc 
ccatactcgg 
ttgacccgat 
acagctatta 
aaagcgttac 
agttcggctt 
aaaagcctga 
ttccaggtct 
aaaagcgcta 
taaggagaga 
ttaaagaggg 
caaaatacag 
aggactacaa 
taaaagtgaa 
gcgatagggt 
actacataga 
gaaaggagga 
ggtag 2325 



ccttgccttt 420 
aataatgatt 480 
tttgccgtat 54 0 
tgttaaagaa 600 
gtatctcata 660 
acatcccgaa 720 
aatccacttt 780 
tgaggcagtt 840 
tgccgctata 900 
tgctagggca 960 
gctgataggt 1020 
gtatctttta 1080 
agagtataaa 1140 
aggtttgtgg 1200 
tactcacaac 1260 
tccgatagta 1320 
ggacttaatt 1380 
cgaaaagaaa 1440 
cggctatatg 1500 
cgcatggggg 15 60 
taaggttctt 1620 
actcattaaa 1680 
gcttgagctt 1740 
tgcagtcata 1800 
ttggagtgag 18 60 
aagtgttgaa 1920 
ggttccactt 1980 
agccattggc 2040 
accgggcaca 2100 
aattttactt 2160 
aaaccaagtt 2220 
tttaaggtat 2280 
[SEQ ID NO. 106] 



NNN=s TOT, TCC,. TCA, TCG, AGT, AGO (All possdLble S codons) 
NNN= CCT, CCA^ CCG^ CCC (All possible P codons) 
acactgatta cataacaaaa gatggcaagc ctataatccg aatttttaag 60 
gggagtttaa aatagaactt gaccctcatt ttcagcccta tatatatgct 120 
atgactccgc tattgaggag ataaaggcaa taaagggcga gagacatgga 180 
•gagtgctcga tgcagtgaaa gtcaggaaaa aatttttggg aagggaagtt 2 40 
agctcatttt cgagjcatccc caagacgttc cagctatgcg gggcaaaata 
cagctgtggt tgacatttac gaatatgaca taccctttgc caagcgttat 
agggcttgat tcccatggag ggagacgagg agcttaagct ccttgccttt 
cgttttatca tgagggagat gaatttggaa agggcgagat aataatgatt 
atgaagaaga ggccagagta atcacatgga aaaatatcga tttgccgtat 
tgtccaatga aagagaaatg ataaagcgtt ttgttcaagt tgttaaagaa 
atgtgataat aacttacaat ggggacaatt ttgatttgcc gtatctcata 
aaaagctggg agttcggctt gtcttaggaa gggacaaaga acatcccgaa 720 
agaggatggg tgatagtttt gctgtggaaa tcaagggtag aatccacttt 780 
cagttgtgcg aaggacgata aacctcccaa cgtatacgct tgaggcagtt 
ttttaggaaa aaccaaaagc aaattaggag cagaggaaat tgccgctata 
aagaaagcat gaaaaaacta gcccagtact caatggaaga tgctagggca 
tcgggaagga attcttcccc atggaagctg agctggcaaa gctgataggt 



300 
360 
420 
480 
540 
600 
660 



840 
900 
960 
1020 
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caaagtgtat 
agggtggcat 
cggcgcttaa 
gaaaatatca 
gtatccccag 
ggatataggt 
gcaatgaggc 
atgctcgatt 
gggtatccta 
agacactaca 
tatgcggaca 
aagaaagcca 
gagtatgagg 
gatgaagagg 
atagctaagg 
aaagctgtag 
gaaaagcttg 
cctcatgtcg 
ataataagct 
acagaatacg 
ttgccggcag 
caaagctcaa 

Vent G390A 
Vent G39.0P 
atgatactgg 
aaagagaacg 
cttctcaaag 
aaaactgtga 
gaagtctgga 
agggaacatc 
ctcatagaca 
gatattgaaa 
agttatgccg 
gtcgatgttg 
aaagaccccg 
aaacgggcag 
cccaagattc 
gatcttttcc 
tatgaagcag 
tgggaaacag 
acgtatgagc 
caaagtgtat 
agggtggcat 
cggcgcttaa 
gaaaatatca 
gtatccccag 
ggatataggt 
gcaatgaggc 
atgctcgatt 
gggtatccta 
agacactaca 
tatgcggaca 



gggacgtctc 
acgcgaggaa 
gaacaactta 
tttatttgga 
atacccttga 
tctgcaagga 
aagatataaa 
ataggcaaag 
aggcaagatg 
tagagatgac 
ctgacggctt 
aggaattcct 
gcttttactt 
gcaggataac 
agactcaggc 
aagttgttag 
ttatccatga 
cgatagcaaa 
atatcgttct 
atcctagaaa 
tacttaggat 
aacaaaccgg 



gagatcaagc 
tgaacttgca 
cctgNNNgga 
tttccgcagt 
aaaagagggc 
ctttccgggc 
gaagaaaatg 
ggctattaaa 
gtactcgaag 
gataagagaa 
ttatgccaca 
aaactacata 
gagaggattc 
aacaaggggc 
aaaggtttta 
agatgttgta 
gcagattacc 
aagacttgcc 
caaagggagc 
acacaagtac 
actcgaagcg 
cttagatgca 



accggcaacc 
ccgaacaaac 
tatgtaaaag 
ctgtaccctt 
tgtaagaatt 
tttattccct 
aaatccacaa 
ttgcttgcaa 
gaatgtgctg 
atagaggaaa 
atacccgggg 
aactccaaac 
tttgttacaa 
ttggaagtag 
gaggctatac 
gagaaaatag 
agggatttaa 
gcaagaggga 
ggaaagataa 
gatccggact 
tttggataca 
tggctcaaga 



tcgtggagtg 
ctgatgagga 
agccagaaaa 
caataatagt 
acgatgttgc 
ccatactcgg 
ttgacccgat 
acagctatta 
aaagcgttac 
agttcggctt 
aaaagcctga 
ttccaggtct 
aaaagcgcta 
taaggagaga 
ttaaagaggg 
caaaatacag 
aggactacaa 
taaaagtgaa 
gcgatagggt 
actacataga 
gaaaggagga 
ggtag 2325 



gtatctttta 1080 
agagtataaa 1140 
aggtttgtgg 1200 
tactcacaac 1260 
tccgatagta 1320 
ggacttaatt 1380 
cgaaaagaaa 1440 
cggctatatg 1500 
cgcatggggg 1560 
taaggttctt 1620 
actcattaaa 1680 
gcttgagctt 1740 
tgcagtcata 1800 
ttggagtgag 18 60 
aagtgttgaa 1920 
ggttccactt 1980 
agccattggc 2040 
accgggcaca 210.0 
aattttactt 2160 
aaaccaagtt 2220 
tttaaggtat 2280 
[SEQ ID NO, 107] 



NNN- GCA, GOT, GCC, GCG (All possible A codons) 
NNN= CCT^ CCA, CCG, CCC (All possible P codons) 
acactgatta cataacaaaa gatggcaagc ctataatccg aatttttaag 60 
gggagtttaa aatagaactt gaccctcatt ttcagcccta tatatatgct 120 
atgactccgc tattgaggag ataaaggcaa taaagggcga gagacatgga 180 
gagtgctcga tgcagtgaaa gtcaggaaaa aatttttggg aagggaagrtt 240 
agctcatttt cgagcatccc caagacgttc cagctatgcg gggcaaaata 300 
cagctgtggt tgacatttac gaatatgaca taccctttgc caagcgttat 360 
agggcttgat tcccatggag ggagacgagg agcttaagct ccttgccttt 420 
cgttttatca tgagggagat gaatttggaa agggcgagat aataatgatt 480 
atgaagaaga ggccagagta atcacatgga aaaatatcga tttgccgtat 540 
tgtccaatga aagagaaatg ataaagcgtt ttgttcaagt tgttaaagaa 600 
atgtgataat aacttacaat ggggacaatt ttgatttgcc gtatctcata 660 , 
aaaagctggg agttcggctt gtcttaggaa gggacaaaga acatcccgaa 720 
agaggatggg tgatagtttt gctgtggaaa tcaagggtag aatccacttt 7 80 
cagttgtgcg aaggacgata aacctcccaa cgtatacgct tgaggcagtt 840 
ttttaggaaa aaccaaaagc aaattaggag cagaggaaat tgccgctata 900 
aagaaagcat gaaaaaacta gcccagtact caatggaaga tgctagggca 960 
tcgggaagga attcttcccc atggaagctg agctggcaaa gctgataggt 1020 
gggacgtctc gagatcaagc accggcaacc tcgtggagtg gtatctttta 1080 
acgcgaggaa tgaacttgca ccgaacaaac ctgatgagga agagtataaa 1140 
gaacaactta cctgggaNNN tatgtaaaag agccagaaaa aggtttgtgg 1200 
tttatttgga tttccgcagt ctgtaccctt caataatagt tactcacaac 1260 
atacccttga aaaagagggc tgtaagaatt acgatgttgc tccgatagta 1320 
tctgcaagga ctttccgggc tttattccct ccatactcgg ggacttaatt 1380 
aagatataaa gaagaaaatg aaatccacaa ttgacccgat cgaaaagaaa 1440 
ataggcaaag ggctattaaa ttgcttgcaa acagctatta cggctatatg 1500 
aggcaagatg gtactcgaag gaatgtgctg aaagcgttac cgcatggggg 1560 
tagagatgac gataagagaa atagaggaaa agttcggctt taaggttctt 1620 
ctgacggctt ttatgccaca atacccgggg aaaagcctga actcattaaa 1680 
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aagaaagcca aggaattcct aaactacata aactccaaac ttccaggtct gcttgagctt 1740 
gagtatgagg gcttttactt gagaggattc tttgttacaa aaaagcgcta tgcagtcata 1800 
gatgaagagg gcaggataac aacaaggggc ttggaagtag taaggagaga ttggagtgag 18 60 
atagctaagg agactcaggc aaaggtttta gaggctatac ttaaagaggg aagtgttgaa .1920 
aaagctgtag aagttgttag agatgttgta gagaaaatag caaaatacag ggttccactt 1980 
gaaaagcttg ttatccatga gcagattacc agggatttaa aggactacaa agccattggc 2040 
cctcatgtcg cgatagcaaa aagacttgcc gcaagaggga taaaagtgaa accgggcaca 2100 
ataataagct atatcgttct caaagggagc ggaaagataa gcgatagggrt aattttactt 2160 
acagaatacg atcctagaaa acacaagtac gatccggact actacataga aaaccaagtt 2220 
ttgccggcag tacttaggat actcgaagcg tttggataca gaaaggagga tttaaggtat 2280 
caaagctcaa aacaaaccgg cttagatgca tggctcaaga ggtag 2325 [SEQ ID NO, 108) 

Vent D407E 1SINN= GAA, GAG (All possible E codons) 

atgatactgg acactgatta cataacaaaa gatggcaagc ctataatccg aatttttaag 60 
aaagagaacg gggagtttaa aatagaactt gaccctcatt ttcagcccta tatatatgct 120 
cttctcaaag atgactccgc tattgaggag ataaaggcaa taaagggcga gagacatgga 180 
aaaactgtga gagtgctcga tgcagtgaaa gtcaggaaaa aatttttggg aagggaagtt 240 
gaagtctgga agctcatttt cgagcatccc caagacgttc cagctatgcg gggcaaaata 300 
agggaacatc cagctgtggt tgacatttac gaatatgaca taccctttgc caagcgttat 360 
ctcatagaca agggcttgat tcccatggag ggagacgagg agcttaagct ccttgccttt 420 
gatattgaaa cgttttatca tgagggagat gaattiiggaa agggcgagat aataatgatt 4 80 
agttatgccg atgaagaaga ggccagagta atcacatgga aaaatatcga tttgccgtat 540 
gtcgatgttg tgtccaatga aagagaaatg ataaagcgtt ttgttcaagt tgttaaagaa 600 
aaagaccccg atgtgataat aacttacaat ggggacaatt ttgatttgcc gtatctcata 660 
aaacgggcag aaaagctggg agttcggctt gtcttaggaa gggacaaaga acatcccgaa 720 
cccaagattc agaggatggg tgatagtttt gctgtggaaa tcaagggtag aatccacttt 780 
gatcttttcc cagttgtgcg aaggacgata aacctcccaa cgtatacgct tgaggcagtt 840 
tatgaagcag ttttaggaaa aaccaaaagc aaattaggag cagaggaaat tgccgctata 900 
tgggaaacag aagaaagcat gaaaaaacta gcccagtact caatggaaga tgctagggca 960 
acgtatgagc tcgggaagga attcttcccc atggaagctg agctggcaaa gctgataggt 1020 
caaagtgtat gggacgtctc gagatcaagc accggcaacc tcgtggagtg gtatctttta 1080 
agggtggcat acgcgaggaa tgaacttgca ccgaacaaac ctgatgagga agagtataaa 1140 
cggcgcttaa gaacaactta cctgggagga tatgtaaaag agccagaaaa aggtttgtgg 1200 
gaaaatatca tttatttgNN Nttccgcagt ctgtaccctt caataatagt tactcacaac 1260 
gtatccccag atacccttga aaaagagggc tgtaagaatt acgatgttgc tccgatagta 1320 
ggatataggt tctgcaagga ctttccgggc tttattccct ccatactcgg ggacttaatt 1380 
gcaatgaggc aagatataaa gaagaaaatg aaatccacaa ttgacccgat cgaaaagaaa 1440 
atgctcgatt ataggcaaag ggctattaaa ttgcttgcaa acagctatta cggctatatg 1500 
gggtatccta aggcaagatg gtactcgaag gaatgtgctg aaagcgttac cgcatggggg 1560 
agacactaca tagagatgac gataagagaa atagaggaaa agttcggctt taaggttctt 1620 
tatgcggaca ctgacggctt ttatgccaca atacccgggg aaaagcctga actcattaaa 1680 
aagaaagcca aggaattcct aaactacata aactccaaac ttccaggtct gcttgagctt 1740 
gagtatgagg gcttttactt gagaggattc tttgttacaa aaaagcgcta tgcagtcata 1800 
gatgaagagg gcaggataac aacaaggggc ttggaagtag taaggagaga ttggagtgag 1860 
atagctaagg agactcaggc aaaggtttta gaggctatac ttaaagaggg aagtgttgaa 1920 
aaagctgtag aagttgttag agatgttgta gagaaaatag caaaatacag ggttccactt 1980 
gaaaagcttg ttatccatga gcagattacc agggatttaa aggactacaa agccattggc 2040 
cctcatgtcg cgatagcaaa aagacttgcc gcaagaggga taaaagtgaa accgggcaca 2100 
ataataagct atatcgttct caaagggagc ggaaagataa gcgatagggt aattttactt 2160 
acagaatacg atcctagaaa acacaagtac gatccggact actacataga aaaccaagtt 2220 
ttgccggcag tacttaggat actcgaagcg tttggataca gaaaggagga tttaaggtat 2280 
caaagctcaa aacaaaccgg cttagatgca tggctcaaga ggtag 2325 [SEQ ID NO. 109] 
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Vent T544I> NNN= CCT^ CCA^ CCG, CCC (All possible P codons) 

atgatactgg acactgatta cataacaaaa gatggcaagc ctataatccg aatttttaag 60 
aaagagaacg gggagtttaa aatagaactt gaccctcatt ttcagcccta tatatatgct 120 
cttctcaaag atgactccgc tattgaggag ataaaggcaa taaagggcga gagacatgga 180 
aaaactgtga gagtgctcga tgcagtgaaa gtcaggaaaa aatttttggg aagggaagtt 24 0 
gaagtctgga agctcatttt cgagcatccc caagac^ttc cagctatgcg gggcaaaata 300 
agggaacatc cagctgtggt tgacatttac gaatatgaca taccctttgc caagcgttat 360 
ctcatagaca agggcttgat tcccatggag ggagacgagg agcttaagct ccttgccttt 420 
gatattgaaa cgttttatca tgagggagat gaatttggaa agggcgagat aataatgatt 480 
agttatgccg atgaagaaga ggccagagta atcacatgga aaaatatcga tttgccgtat 54 0 
gtcgatgttg tgtccaatga aagagaaatg ataaagcgtt ttgttcaagt tgttaaagaa 600 
aaagaccccg atgtgataat aacttacaat ggggacaatt ttgatttgcc gtatctcata 660 
aaacgggcag aaaagctggg agttcggctt gtcttaggaa gggacaaaga acatcccgaa 720 
cccaagattc agaggatggg tgatagtttt gctgtggaaa tcaagggtag aatccacttt 780 
gatcttttcc cagttgtgcg aaggacgata aacctcccaa cgtatacgct. tgaggcagtt 84 0 
tatgaagcag ttttaggaaa aaccaaaagc aaattaggag cagaggaaat tgccgctata 900 
tgggaaacag aagaaagcat gaaaaaacta gcccagtact caatggaaga tgctagggca 960 
acgtatgagc tcgggaagga attcttcccc atggaagctg agctggcaaa gctgataggt 1020 
caaagtgtat gggacgtctc gagatcaagc accggcaacc tcgtggagtg gtatctttta 1080 
agggtggcat acgcgaggaa tgaacttgca ccgaacaaac ctgatgagga agagtataaa 1140 
cggcgcttaa gaacaactta cctgggagga tatgtaaaag agccagaaaa aggtttgtgg 1200 
gaaaatatca tttatttgga tttccgcagt ctgtaccctt. caataatagt tactcacaac 1260 
gtatccccag atiacccttga aaaagagggc tgtaagaatt acgatgttgc tccgatagta 1320 
ggatataggt tctgcaagga ctttccgggc tttattccct ccatactcgg ggacttaatt 1380 
gcaatgaggc aagatataaa gaagaaaatg aaatccacaa ttgacccgat cgaaaagaaa 1440 
atgctcgatt ataggcaaag ggctattaaa ttgcttgcaa acagctatta cggctatatg 1500 
gggtatccta aggcaagatg gtactcgaag gaatgtgctg aaagcgttac cgcatggggg 1560 
agacactaca tagagatgac gataagagaa atagaggaaa agttcggctt taaggttctt 1620 
tatgcggacN NNgacggctt ttatgccaca atacccgggg aaaagcctga actcattaaa 1680 
aagaaagcca aggaattcct aaactacata aactccaaac ttccaggtct gcttgagctt 1740 
gagtatgagg gcttttactt gagaggattc tttgttacaa aaaagcgcta tgcagtcata 1800 
gatgaagagg gcaggataac aacaaggggc ttggaagtag taaggagaga ttggagtgag 1860 
atagctaagg agactcaggc aaaggtttta gaggctatac ttaaagaggg aagtgttgaa 1920 
aaagctgtag aagttgttag agatgttgta gagaaaatag caaaatacag ggttccactt 1980 
gaaaagcttg ttatccatga gcagattacc agggatttaa aggactacaa agccattggc 2040 
cctcatgtcg cgatagcaaa aagacttgcc gcaagaggga taaaagtgaa accgggcaca 2100 
ataataagct atatcgttct caaagggagc ggaaagataa gcgatagggt aattttactt 2160 
acagaatacg atcctagaaa acacaagtac gatccggact actacataga aaaccaagtt 2220 
ttgccggcag tacttaggat actcgaagcg tttggataca gaaaggagga tttaaggtat 2280 
caaagctcaa aacaaaccgg cttagatgca tggctcaaga ggtag 2325 [SEQ ID NO. 110] 

Vent D545G NNN=GGT, GGC^ GGA, GGG (All possible 6 codons) 
atgatactgg acactgatta cataacaaaa gatggcaagc ctataatccg aatttttaag 60 
aaagagaacg gggagtttaa aatagaactt gaccctcatt ttcagcccta tatatatgct 120 
cttctcaaag atgactccgc tattgaggag ataaaggcaa taaagggcga gagacatgga 180 
aaaactgtga gagtgctcga tgcagtgaaa gtcaggaaaa aatttttggg aagggaagtt 240 
gaagtctgga agctcatttt cgagcatccc caagacgttc cagctatgcg gggcaaaata 300 
agggaacatc cagctgtggt tgacatttac gaatatgaca taccctttgc caagcgttat 360 
ctcatagaca agggcttgat tcccatggag ggagacgagg agcttaagct ccttgccttt 420 
gatattgaaa cgttttatca tgagggagat gaatttggaa agggcgagat aataatgatt 480 
agttatgccg atgaagaaga ggccagagta atcacatgga aaaatatcga tttgccgtat 540 
gtcgatgttg tgtccaatga aagagaaatg ataaagcgtt ttgttcaagt tgttaaagaa 600 
aaagaccccg atgtgataat aacttacaat ggggacaatt ttgatttgcc gtatctcata 660 
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aaacgggcag aaaagctggg agttcggctt gtcttaggaa gggacaaaga acatcccgaa 720 
cccaagattc agaggatggg tgatagtttt gctgtggaaa tcaagggtag aatccacttt 780 
gatcttttcc cagttgtgcg aaggacgata aacctcccaa cgtatacgct tgaggcagtt 84 0 
tatgaagcag ttttaggaaa aaccaaaagc aaattaggag cagaggaaat tgccgctata 900 
tgggaaacag aagaaagcat gaaaaaacta gcccagtact caatggaaga tgctagggca 960 
acgtatgagc tcgggaagga attcttcccc atggaagctg agctggcaaa gctgataggt 1020 
caaagtgtat gggacgtctc gagatcaagc accggcaacc tcgtggagtg gtatctttta 1080 
agggtggcat acgcgaggaa tgaacttgca ccgaacaaac ctgatgagga agagtataaa 1140 
cggcgcttaa gaacaactta cctgggagga tatgtaaaag agccagaaaa aggtttgtgg 1200 
gaaaatatca tttatttgga tttccgcagt ctgtaccctt caataatagt tactcacaac 1260 
gtatccccag atacccttga aaaagagggc tgtaagaatt acgatgttgc tccgatagta 1320 
ggatataggt tctgcaagga ctttccgggc tttattccct ccatactcgg ggacttaatt 1380 
gcaatgaggc aagatataaa gaagaaaatg aaatccacaa ttgacccgat cgaaaagaaa 14 40 
atgctcgatt ataggcaaag ggctattaaa ttgcttgcaa acagctatta cggctatatg 1500 
gggtatccta aggcaagatg gtactcgaag gaat'gtgctg aaagcgttac cgcatggggg 1560 
agacactaca tagagatgac gataagagaa a1:agaggaaa agttcggctt taaggttctt 1620 
tatgcggaca ctllNNggctt ttatgccaca atacccgggg aaaagcctga actcattaaa 1680 
aagaaagcca aggaattcct aaactacata aactccaaac ttccaggtct gcttgagctt 1740 
gagtatgagg gcttttactt gagaggattc tttgttacaa aaaagcgcta tgcagtcata 1800 
gatgaagagg gcaggataac aacaaggggc ttggaagtag taaggagaga ttggagtgag 1860 
atagctaagg agactcaggc aaaggtttta gaggctatac ttaaagaggg aagtgttgaa 1920 
aaagctgtag aagttgttag agatgttgta gagaaaatag caaaatacag ggttccactt 1980 
gaaaagcttg ttatccatga gcagattacc agggatttaa aggactacaa agccattggc 2040 
cctcatgtcg cgatagcaaa aagacttgcc gcaagaggga taaaagtgaa accgggcaca 2100 
ataataagct atatcgttct caaagggagc ggaaagataa gcgatagggt aattttactt 2160 
acagaatacg atcctagaaa acacaagtac gatccggact actacataga aaaccaagtt 2220 
ttgccggcag tacttaggat actcgaagcg tttggataca gaaaggagga tttaaggtat 2280 
caaagctcaa aacaaaccgg cttagatgca tggctcaaga ggtag 2325 [SEQ ID NO. Ill] 

Vent K595T NNN=ACT, ACC, ACA, ACG (All possiJble T codons) 
atgatactgg acactgatta cataacaaaa gatggcaagc ctataatccg aatttttaag 60 
aaagagaacg gggagtttaa aatagaactt gaccctcatt ttcagcccta tatatatgct 120 
cttctcaaag atgactccgc tattgaggag ataaaggcaa taaagggcga gagacatgga 180 
aaaactgtga gagtgctcga tgcagtgaaa gtcaggaaaa aatttttggg aagggaagtt 240 
gaagtctgga agctcatttt cgagcatccc caagacgttc cagctatgcg gggcaaaata 300 
agggaacatc cagctgtggt tgacatttac gaatatgaca taccctttgc caagcgttat 360 
ctcatagaca agggcttgat tcccatggag ggagacgagg agcttaagct ccttgccttt 420 
gatattgaaa cgttttatca tgagggagat gaatttggaa agggcgagat aataatgatt 480 
agttatgccg atgaagaaga ggccagagta atcacatgga aaaatatcga tttgccgtat 540 
gtcgatgttg tgtccaatga aagagaaatg ataaagcgtt ttgttcaagt tgttaaagaa 600 
aaagaccccg atgtgataat aacttacaat ggggacaatt ttgatttgcc gtatctcata 660 
aaacgggcag aaaagctggg agttcggctt gtcttaggaa gggacaaaga acatcccgaa 720 
cccaagattc agaggatggg tgatagtttt gctgtggaaa tcaagggtag aatccacttt 780 
gatcttttcc cagttgtgcg aaggacgata aacctcccaa cgtatacgct tgaggcagtt 840 
tatgaagcag ttttaggaaa aaccaaaagc aaattaggag cagaggaaat tgccgctata 900 
tgggaaacag aagaaagcat gaaaaaacta gcccagtact caatggaaga tgctagggca 960 
acgtatgagc tcgggaagga attcttcccc atggaagctg agctggcaaa gctgataggt 1020 
caaagtgtat gggacgtctc gagatcaagc accggcaacc tcgtggagtg gtatctttta 1080 
agggtggcat acgcgaggaa tgaacttgca ccgaacaaac ctgatgagga agagtataaa 1140 
cggcgcttaa gaacaactta cctgggagga tatgtaaaag agccagaaaa aggtttgtgg 1200 
gaaaatatca tttatttgga tttccgcagt ctgtaccctt caataatagt tactcacaac 1260 
gtatccccag atacccttga aaaagagggc tgtaagaatt acgatgttgc tccgatagta 1320 
ggatataggt tctgcaagga ctttccgggc tttattccct ccatactcgg ggacttaatt 1380 
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gcaatgaggc aagatataaa gaagaaaatg aaatccacaa ttgacccgat cgaaaagaaa 1440 
atgctcgatt ^taggcaaag ggctattaaa ttgcttgcaa acagctatta cggctatatg 1500 
gggtatccta aggcaagatg gtactcgaag gaatgtgctg aaagcgttac cgcatggggg 1560 
agacactaca tagagatgac gataagagaa atagaggaaa agttcggctt taaggttctt 1620 
tatgcggaca ctgacggctt ttatgccaca atacccgggg aaaagcctga actcattaaa 1680 
aagaaagcca aggaattcct aaactacata aactccaaac ttccaggtct gcttgagctt 174 0 
gagtatgagg gcttttactt gagaggattc tttgttacaa aaNNNcgcta tgcagtcata 1800 
gatgaagagg gcaggataac aacaaggggc ttggaagtag taaggagaga ttggagtgag 1860 
atagctaagg agactcaggc aaaggtttta gaggctatac ttaaagaggg aagtgttgaa 1920 
aaagctgtag aagttgttag agatgttgta gagaaaatag caaaatacag ggttccactt 1980 
gaaaagcttg ttatccatga, gcagattacc agggatttaa aggactacaa agccattggc 2040 
cctcatgtcg cgatagcaaa aagacttgcc gcaagaggga taaaagtgaa accgggcaca 2100 
ataataagct atatcgttct caaagggagc ggaaagataa gcgatagggt aattttactt 2160 
acagaatacg atcctagaaa acacaagtac gatccggact actacataga aaaccaagtt 2220 
ttgccggcag tacttaggat actcgaagcg tttggataca gaaaggagga tttaaggtat 2280 
caaagctcaa aacaaaccgg cttagatgca tggctcaaga ggtag 2325 [SEQ ID NO. 112] 
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Deep Vent 

mildadyitedgkpiirifkkengef kveydrnfrpyiyallkddsqidevrkitaerhgkivriidaekvrkkflg 

rpievwriyfehpqdvpairdkirehsavidifeydipfakrylidkglipmegdeelkllafdietlyhegeefak 

gpiimisyadeeeakvitwkkidlpyvevvsseremikrflkvirekdpdviityngdsfdlpylvkraeklgiklp 

Igrdgsepkmqrlgdintaveikgrihfdlyhvirrtinlptytleavyeaifgkpkekvyaheiaeawetgkgierv 

akysmedakvtyelgreffpiaeaqlsrlvgqplwdvsrsstgnlvewyllrkayernelapnkpdereyerrlresy 

aggyvkepekgiweglvsldfrslypsiiithnvspdtlnregcreydvapevghkfckdfpgflpsllkrllderq 

eikrkrakaskdpiekkmldyrqraikilansyygyygyakarwyckecaesvtawgreyiefvrkeleekfgfkvly 

idtdglyatipgakpeeikkkalefvdyinaklpglleleyegfyvrgffvtkkkyalideegkiitrgleivrrdw 

seiaketqakvleailkhgnveeavkivkevteklskyeippeklviyeqitrplheykaigphvavakrlaargvk 

vrpgmvigyivlrgdgpiskrailaeefdlrkhkydaeyyienqvlpavlrileafgyrkedlrwaktkotaltawl 
nikkk [SEQ ID NO. 113] 



AGAGAGCATT CCGCAGTTAT 
CTAATAGACA AAGGCCTAAT 



Deep Vent YBBSN NNN= AAT, AAC (AIX possible N codons) 
Deep Vent YBBSI* 1SINN= TTA^ TTG, CTT , CTC, CTA, CTG (All 
Deep Vent Y385H NNN= CAT, CAC (AIX possible H codons) 
Deep Vent Y385Q NNN= CAA, CAG (All possible Q codons) 
Deep Vent y385S NNN= TCT, TCC, TCA, TCG^ AGT, AGC (All 
ATGATACTTG ACGCTGACTA CATCACCGAG GATGGGAAGC CGATTATAAG 
AAAGAAAACG GCGAGTTTAA GGTTGAGTAC GACAGAAACT TTAGACCTTA 
CTCCTCAAAG ATGACTCGCA GATTGATGAG GTTAGGAAGA TAACCGCCGA 
AAGATAGTGA GAATTATAGA TGCCGAAAAG GTAAGGAAGA AGTTCCTGGG 
GAGGTATGGA GGCTGTACTT TGAACACCCT CAGGACGTTC CCGCAATAAG 

TGACATCTTT GAGTACGACA TTCCGTTCGC 
TCCAATGGAA GGCGATGAAG AGCTCAAGTT 
GACATAGAAA CCCTCTATCA CGAAGGGGAG GAGTTCGCGA AGGGGCCCAT 
AGCTATGCTG ATGAGGAAGA AGCCAAAGTC ATAACGTGGA AAAAGATCGA 
GTCGAGGTAG TTTCCAGCGA GAGGGAGATG ATAAAGCGGT TCCTCAAGGT 
AAAGATCCCG ATGTTATAAT TACCTACAAC GGCGATTCTT TCGACCTTCC 

aagagggccg aaaagctcgg gataaagcta cccctgggaa gggacggtag 
atgcagaggc ttggggatat gacagcggtg gagataaagg gaaggataca 
tacc:acgtga ttaggagaac gataaacctc ccaacataca ccctcgaggc 
gcaatcttcg gaaagccaaa ggagaaagtt tacgctcacg agatagctga 
actggaaagg gactggagag agttgcaaag tattcaatgg aggatgcaaa 
gagctcggta gggagttctt cccaatggag gcccagcttt caaggttagt 

CTGTGGGATG TTTCTAGGTC TTCAACTGGC AACTTGGTGG AGT(3GTACCT 
GCCTACGAGA GGAATGAATT G(3CTCCAAAC AAGCCGGATG AGAGGGAGTA 
CTAAGGGAGA G OaNNG CTGG GGGATACGTT AAGGAGCCGG AGAAAGGGCT 
TTAGTTTCCC TAGATTTCAG GAGCCTGTAC CCCTCGATAA TAATCACCCA 
CCGGATACGC TGAACAGGGA AGGGTGTAGG GAATACGATG TCGCCCCAGA 
T^GTTCTGCA AGGACTTCCC GGGGTTTATC CCCAGCCTGC TCAAGAGGTT 
AGGCAAGAAA TAAAAAGGAA GATGAAAGCT TCTAAA(3ACC CAATCGAGAA 
GATTAChGGC AACGGGCAAT CAAAATCCTG GCAAACAGCT ATTATGGGTA 
GCAAAAGCCC GTTGGTACTG TAAGGAGTGC GCAGAGAGCG TTACGGCCTG 
TATATAGAGT TCGTAAGGAA GGAACTGGAG GAAAAGTTCG GGTTCAAAGT 
GACACAGATG GACTCTACGC CACAATTCCT GGGGCLW^UVC CCGAGGA(3AT 
GCCCTAGAGT TCGTAGATTA TATAAACGCC AAGCTCCCAG GGCTGTTGGA 
GAGGGCTTCT ACGTGAGAGG GTTCTTCGTG ACG7UVGAAGA AGTATGCGTT 
GAAGGGAAGA* TAATCACTAG GGGGCTTGAA ATAGTCAGGA GGGACTGGAG 
AAAGAAACCC AAGCAAAAGT CCTAGAGGCT ATCCTAAAGC ATGGCAACGT 
GTAAAGATAG TTAAGGAGGT 7VACTGAAAAG CTGACJCAAGT ACGAAATACC 
CTAGTTATTT ACGA(;CAGAT CACGAGGCCC CTTCACGAGT ACAAGGCTAT 
GTTGCCGTGG CAAAAAGGTT AGCCGCTAGA GGAGTAAAGG TGAGGCCTGG 



possible Ja codons) 



possi±»le S 

GATTTTCAAG 

CATTTACGCT 

GAGGCATGGG 

GAGGCCGATT 

GGATAAGATA 

GAAGAGGTAC 

GCTCGCATTT 

TATAATGATA 

TCTCCCGTAC 

GATAAGGGAG 

CTATCTAGTT 

TGAGCCAAAG 

CTTTGACCTC 

AGTTTATGAG 

GGCCTGGGAG 

GGTAACGTAC 

CGGCCAGCCC 

CCTCAGGAAG 

CGAGAGAAGG 

CTGGGAGGGG 

TAACGTCTCA 

GGTTGGGCAC 

ATTGGATGAA 

GAAGATGCTT 

TTATGGGTAC 

GGGGAGGCSAA 

CTTATACATA 

AAAGAAGAAA 

GGTTGAGTAC 

GATAGATC3AG 

CGAAATAGCC 

TGAGGAGGCA 

TCCAGAAAAG 

AGGTCCGCaiC 

CATGGTGATA 



codons) 
60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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GGGTACATAG TGCTGAGGGG AGACGGGCCA ATAAGCAAGA GGGCTATCCT TGCAGAGGAG 2160 

TTCGATCTCA GGAAGCATAA GTATGACGCT GAGTATTACA TAGAAAATCA GGTTTTACCT 2220 

. GCCGTTCTTA GAATATTAGA GGCCTTTGGG TACAGGAAAG AAGACCTCAG GTGGCAGAAG 2280 

ACTAAACAGA CAGGTCTTAC GGCATGGCTT AACATCAAGA AGAAGTAA 2328 
[SEQ ID NO. 114] 

Deep Vent G387S NNN= TCT, TCC, TCA, TCG, AGO?, AGC (All possible S codons) 
Deep Vent G387P NNN= CCT, CCA, CCG, CCC (All possi±>le P codons) 

ATGATACTTG ACGCTGACTA CATCACCGAG GATGGGAAGC CGATTATAAG GATTTTCAAG 60 

AAAGAAAACG GCGAGTTTAA GGTTGAGTAC GACAGAAACT TTAGACCTTA CATTTACGCT 120 

CTCCTCAAAG ATGACTCGCA GATTGATGAG GTTAGGAAGA TAACCGCCGA GAGGCATGGG 180 

AAGATAGTGA GAATTATAGA TGCCGAAAAG GTAAGGAAGA AGTTCCTGGG GAGGCCGATT 240 

GAGGTATGGA GGCTGTACTT TGAACACCCT CAGGACGTTC CCGCAATAAG GGATAAGATA 300 

AGAGAGCATT CCGCAGTTAT TGACATCTTT GAGTACGACA TTCCGTTCGC GAAGAGGTAC 360 

CTAATAGACA AAGGCCTAAT TCCAATGGAA GGCGATGAAG AGCTCAAGTT GCTCGCATTT 420 

GACATAGAAA CCCTCTATCA CGAAGGGGAG GAGTTCGCGA AGGGGCCCAT TATAATGATA 480 

AGCTATGCTG ATGAGGAAGA AGCCAAAGTC ATAACGTGGA AAAAGATCGA TCTCCCGTAC 54 0 

GTCGAGGTAG TTTCCAGCGA GAGGGAGATG ATAAAGCGGT TCCTCAAGGT GATAAGGGAG 600 

AAAGATCCCG ATGTTATAAT TACCTACAAC GGCGATTCTT TCGACCTTCC CTATCTAGTT 660 

AAGAGGGCCG AAAAGCTCGG GATAAAGCTA CCCCTGGGAA GGGACGGTAG TGAGCCAAAG 720 

ATGGAGAGGC TTGGGGATAT GACAGCGGTG GAGATAAAGG GAAGGATACA CTTTGACCTC 780 

TACCACGTGA TTAGGAGAAC GATAAACCTC CCAACATACA CCCTCGAGGC AGTTTATGAG 840 

GCAATCTTCG GAAAGCCAAA GGAGAAAGTT TACGCTCACG AGATAGCTGA GGCCTGGGAG 900 

ACTGGAAAGG GACTGGAGAG AGTTGCAAAG TATTCAATGG AGGATGC7VAA GGTAACGTAC 960 

GAGCTCG6TA GGGAGTTCTT CCCAATGGAG GCCCAGCTTT CAAGGTTAGT CGGCCAGCCC 1020 

CTGTGGGATG. TTTCTAGGTC TTCAACTGGC AACTTGGTGG AGTGGTACCT CCTCAGGAAG 1080 

GCCTACGAGA GGAATGAATT GGCTCCAAAC AAGCCGGATG AGAGGGAGTA CGAGAGAAGG 1140 

CTAAGGGAGA GCTACGCl^ NGGATACGTT AAGGAGCCGG AGAAAGGGCT CTGGGAGGGG 1200 

TTAGTTTCCC TAGATTTCAG GAGCCTGTAC CCCTCGATAA TAATCACCCA TAACGTCTCA 1260 

CCGGATACGC TGAACAGGGA AGGGTGTAGG GAATACGATG TCGCCCCAGA GGTTGGGCAC 1320 

AAGTTCTGCA- AGGACTTCCC GGGGTTTATC CCCAGCCTGC TCAAGAGGTT ATTGGATGAA 1380 

AGGCAAGAAA TAAAAAGGAA GATGAAAGCT TCTAAAGACC CAATCGAGAA GAAGATGCTT 144 0 

GATTACAGGC AACGGGCAAT CAAAATCCTG GCAAACAGCT ATTATGGGTA TTATGGGTAC 1500 

GCAAAAGCCC GTTGGTACTG TAAGGAGTGC GCAGAGAGCG TTACGGCCTG GGGGAGGGAA 1560 

TATATAGAGT TCGTAAGGAA GGAACTGGAG GAAAAGTTCG GGTTCAAAGT CTTATACATA 1620 

GACACAGATG GACTCTACGC CACAATTCCT GGGGCAAAAC CCGAGGAGAT AAAGAAGAAA 1680 

GCCCTAGAGT TCGTAGATTA TATAAACGCC AAGCTCCCAG GGCTGTTGGA GGTTGAGTAC 1740 

GAGGGCTTCT ACGTGAGAGG GTTCTTCGTG ACGAAGAAGA AGTATGCGTT GATAGATGAG 1800 

GAAGGGAAGA TAATCACTAG GGGGCTTGAA ATAGTCAGGA GGGACTGGAG CGAAATAGCC 1860 

AAAGTU^CCC AAGCAAAAGT CCTAGAGGCT ATCCTAAAGC ATGGCAACGT TGAGGAGGCA 1920 

GTAAAGATAG TTAAGGAGGT AACTGAAAAG CTGAGCAAGT ACGAAATACC TCCAGAATU^G 1980 

CTAGTTATTT ACGAGCAGAT CACGAGGCCC CTTCACGAGT ACAAGGCTAT AGGTCCGCAC 2040 

GTTGCCGTGG CAAAAAGGTT - AGCCGCTAGA GGAGTAAAGG TGAGGCCTGG CATGGTGATA 2100 

GGGTACATAG TGCTGAGGGG AGACGGGCCA ATAAGCAAGA GGGCTATCCT TGCAGAGGAG 2160 

TTCGATCTCA GGAAGCATAA GTATGACGCT GAGTATTACA TAGAAAATCA GGTTTTACCT 2220 

GCCGTTCTTA GAATATTAGA GGCCTTTGGG TACAGGAAAG AAGACCTCAG GTGGCAGAAG 2280 

ACTAAACAGA CAGGTCTTAC GGCATGGCTT AACATCAAGA AGAAGTAA " 2328 
[SEQ ID NO.. 115] 

Deep Vent G388A NNN= GGA, GCT, GCC, GCG (All possible A codons) 
Deep Vent G388P NNN= CCT, CCA, CCG, CCC (All possible P codons) 

ATGATACTTG ACGCTGACTA CATCACCGAG GATGGGAAGC CGATTATAAG GATTTTCAAG 60 

AAAGAAAACG GCGAGTTTAA GGTTGAGTAC GACAGAAACT TTAGACCTTA CATTTACGCT 120 



/ 
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CTCCTCAAAG ATGACTCGCA GATTGATGAG GTTAGGAAGA TAACCGCCGA GAGGCATGGG 180 

AAGATAGTGA GAATTATAGA TGCCGAAAAG GTAAGGAAGA AGTTCCTGGG GAGGCCGATT 240 

GAGGTATGGA GGCTGTACTT TGAACACCCT CAGGACGTTC CCGCAATAAG GGATAAGATA 300 

AGAGAGCATT CCGCAGTTAT TGACATCTTT GAGTACGACA TTCCGTTCGC GAAGAGGTAC 360 

CTAATAGACA AAGGCCTAAT TCCAATGGAA GGCGATGAAG AGCTCAAGTT GCTCGCATTT 420 

GACATAGAAA CCCTCTATCA CGAAGGGGAG GAGTTCGCGA AGGGGCCCAT TATAATGATA 4 80 

AGCTATGCTG ATGAGGAAGA AGCCAAAGTC ATAACGTGGA AAAAGATCGA TCTCCCGTAC 54 0 

GTCGAGGTAG TTTCCAGCGA GAGGGAGATG ATA7VAGCGGT TCCTCAAGGT GATAAGGGAG 600 

AAAGATCCCG ATGTTATAAT TACCTACAAC GGCGATTCTT TCGACCTTCC CTATCTAGTT 660 

AAGAGGGCCG AAAAGCTCGG GATAAAGCTA CCCCTGGGAA GGGACGGTAG TGAGCCAAAG 720 

ATGCAGAGGC TTGGGGATAT GACAGCGGTG GAGATAAAGG GAAGGATACA CTTTGACCTC 780 

TACCACGTGA TTAGGAGAAC GATAAACCTC CCAACATACA CCCTCGAGGC AGTTTATGAG 840 

GCAATCTTCG G7\AAGCCAAA GGAGAAAGTT TACGCTCACG AGATAGCTGA GGCCTGGGAG 900 

ACTGGAAAGG GACTGGAGAG AGTTGCAAAG TATTCAATGG AGGATGCAAA GGTAACGTAC 960 

GAGCTCGGTA GGGAGTTCTT CCCAATGGAG GCCCAGCTTT CAAGGTTAGT CGGCCAGCCC 1020 

CTGTGGGATG TTTCTAGGTC TTCAACTGGC AACTTGGTGG AGTGGTACCT CCTCAGGAAG 1080 

GCCTACGAGA GGAATGAATT GGCTCCAAAC AAGCCGGATG AGAGGGAGTA CGAGAGAAGG 1140 

CTAAGGGAGA GCTACGCTGG GNNN TACGTT AAGGAGCCGG AGAAAGGGCT CTGGC^AGGGG 1200 

TTAGTTTCCC TAGATTTCAG GAGCCTGTAC CCCTCGATAA TAATCACCCA TAACGTCTCA 1260 

CCGGATACGC TGAACAGGGA AGGGTGTAGG GAATACGATG TCGCCCCAGA GGTTGGGCAC 1320 

AAGTTCTGCA AGGACTTCCC GGGGTTTATC CCCAGCCTGC TCAAGAGGTT ATTGGATGAA 1380 

AGGCAAGAAA TAAAAAGGAA GATGAAAGCT TCTAAAGACC CAATCGAGAA GAAGATGCTT 1440 

GATTACAGGC AACGGGCAAT CAAAATCCTG GCAAACAGCT ATTATGGGTA TTATGGGTAC 1500 

GCAAAAGCCC GTTGGTACTG TAAGGAGTGC GCAGAGAGCG TTACGGCCTG GGGGAGGGAA 1560 

TATATAGAGT TCGTAAGGAA GGAACTGGAG GAAAAGTTCG GGTTCAAAGT CTTATACATA 1620 

GACACAGATG GACTCTACGC CACAATTCCT GGGGCAAAAC CCGAGGAGAT AAAGAAGAAA 1680 

GCCCTAGAGT TCGTAGATTA TATAAACGCC AAGCTCCCAG GGCTGTTGGA GCTTGAGTAC 1740 

GAGGGCTTCT ACGTGAGAGG GTTCTTCGTG ACGAAGAAGA AGTATGCGTT GATAGATGAG 1800 

GAAGGGAAGA TAATCACTAG GGGGCTTGAA ATAGTCAGGA GGGACTGGAG CGAi^ATAGCC 1860 

AAAGAAACCC AAGCAAAAGT CCTAGAGGCT ATCCTAAAGC ATGGCAACGT TGAGGAGGCA 1920 

GTAAAGATAG TTAAGGAGGT AACTGAAAAG CTGAGCAAGT ACGAAATACC TCCAGAAAAG 1980 

CTAGTTATTT ACGAGCAGAT CACGAGGCCC CTTCACGAGT ACAAGGCTAT AGGTCCGCAC 2040 

GTTGCCGTGG CAAAAAGGTT AGCCGCTAGA GGAGTAAAGG TGAGGCCTGG CATGGTGATA 2100 

GGGTACATAG TGCTGAGGGG AGACGGGCCA ATAAGCTiAGA GGGCTATCCT TGCAGAGGAG 2160 

TTCGATCTCA GGAAGCAT7VA GTATGACGCT GAGTATTACA TAGAAAATCA GGTTTTACCT 2220 

GCCGTTCTTA GAATATTAGA GGCCTTTGGG TACAGGAAAG AAGACCTCAG GTGGCAGAAG 2280 

ACTAAACAGA CAGGTCTTAC GGCATGGCTT AACATCAAGA AGAAGTT^ 2328 
[SEQ ID NO. 116] 

Deep Vent D405E NNN= GAA, GAG {All possible E codons) 

ATGATACTTG ACGCTGACTA CATCACCGAG GATGGGAAGC CGATTATAAG GATTTTCAAG 60 

AAAGAAAACG GCGAGTTTAA GGTTGAGTAC GACAGAAACT TTAGACC^PTA CATTTACGCT * 120 

CTCCTCAAAG ATGACTCGCA GATTGATGAG GTTAGGAAGA TAACCGCCGA GAGGCATGGG 180 

AAGATAGTGA GAATTATAGA TGCCGAA/^G GTAAGGAAGA AGTTCCTGGG GAGGCCGATT 240 

GAGGTATGGA GGCTGTACTT TGAACACCCT CAGGACGTTC CCGCAATAAG GGATAAGATA 300 

AGAGAGCATT CCGCAGTTAT TGACATCTTT GAGTACGACA TTCCGTTCGC GAAGAGGTAC 360 

CTAATAGACA AAGGCCTAAT TCCAATGGAA GGCGATGAAG AGCTCAAGTT GCTCGCATTT 420 

GACATAGAAA CCCTCTATCA CGAAGGGGAG GAGTTCGCGA AGGGGCCCAT TATAATGATA 480 

AGCTATGCTG ATGAGGAAGA AGCCAAAGTC ATAACGTGGA AAAAGATCGA TCTCCCGTAC 540 

GTCGAGGTAG TTTCCAGCGA GAGGGAGATG ATAAAGCGGT TCCTCAAGGT GATAAGGGAG 600 

AAAGATCCCG ATGTTATAAT TACCTACAAC GGCGATTCTT TCGACCTTCC CTATCTAGTT 660 

AAGAGGGCCG AAAAGCTCGG GATAAAGCTA CCCCTGGGAA GGGACGGTAG TGAGCCAAAG 720 

ATGCAGAGGC TTGGGGATAT GACAGCGGTG GAGATAAAGG GAAGGATACA CTTTGACCTC 780 
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TACCACGTGA 
GCAATCTTCG 
ACTGGAAAGG 
GAGCTCGGTA 
CTGTGGGATG 
GCCTACGAGA 
CTAAGGGAGA 
TTAGTTTCCC 
CCGGATACGC 
AAGTTCTGCA 
AGGCAAGAAA 
GATTACAGGC 
GCAAAAGCCC 
TATATAGAGT 
GACACAGATG 
GCCCTAGAGT 
GAGGGCTTCT 
GAAGGGAAGA 
AAAGAAACCC 
GTAAAGATAG 
CTAGTTATTT 
GTTGCCGTGG 
GGGTACATAG 
TTCGATCTCA 
GCCGTTCTTA 
ACTAAACAGA 
[SEQ ID NO. 



TTAGGAGAAC 
GAAAGCCAAA 
GACTGGAGAG 
GGGAGTTCTT 
TTTCTAGGTC 
GGAATGAATT 
GCTACGCTGG 
TA NNN TTCAG 
TGAACAGGGA 
AGGACTTCCC 
TAAAAAGGAA 
AACGGGCAAT 
GTTGGTACTG 
TCGTAAGGAA 
GACTCTACGC 
TCGTAGATTA 
ACGTGAGAGG 
TAATCACTAG 
AAGCAAAAGT 
TTAAGGAGGT 
ACGAGCAGAT 
CAAAAAGGTT 
TGCTGAGGGG 
GGAAGCATAA 
GAATATTAGA 
CAGGTCTTAC 
117] 



GATATVACCTC 
GGAGAAAGTT 
AGTTGCAAAG 
CCCAATGGAG 
TTCAACTGGC 
GGCTCCAAAC 
GGGATACGTT 
GAGCCTGTAC 
AGGGTGTAGG 
GGGGTTTATC 
GATGAAAGCT 
CAAAATCCTG 
TAAGGAGTGC 
GGAACTGGAG 
CACAATTCCT 
TATAAACGCC 
GTTCTTCGTG 
GGGGCTTGAA 
CCTAGAGGCT 
AACTGAAAAG 
CACGAGGCCC 
AGCCGCTAGA 
AGACGGGCCA 
GTATGACGCT 
GGCCTTTGGG 
GGCATGGCTT 



CgAACATACA 
TACGCTCACG 
TATTCAATGG 
GCCCAGCTTT 
AACTTGGTGG 
AAGCCGGATG 
AAGGAGCCGG 
CCCTCGATAA 
GAATACGATG 
CCCAGCCTGC 
TCTAAAGACC 
GCA7\ACAGCT 
GCAGAGAGCG 
GAAAAGTTCG 
GGGGCAAAAC 
AAGCTCCCAG 
ACGAAGAAGA 
ATAGTCAGGA 
ATCCTAAAGC 
CTGAGCAAGT 
CTTCACGAGT 
GGAGTAAAGG 
ATAAGCAAGA 
GAGTATTACA 
TACAGGAAAG 
AACATCAAGA 



CCCTCGAGGC 
AGATAGCTGA 
AGGATGCAAA 
CAAGGTTAGT 
AGTGGTACCT 
AGAGGGAGTA 
AGAAAGGGCT 
TAATCACCCA 
TCGCCCCAGA 
TCAAGAGGTT 
CAATCGAGAA 
ATTATGGGTA 
TTACGGCCTG 
GGTTCAAAGT 
CCGAGGAGAT 
GGCTGTTGGA 
AGTATGCGTT 
. GGGACTGGAG 
ATGGCAACGT 
ACGAAATACC 
ACAAGGCTAT 
TGAGGCCTGG 
GGGCTATCCT 
TAGAAAATCA 
AAGACCTCAG 
AGAAGTAA 



AGTTTATGAG 

GGCCTGGGAG 

GGTAACGTAC 

CGGCCAGCCC 

CCTCAGGAAG 

CGAGAGAAGG 

CTGGGAGGGG 

TAACGTCTCA 

GGTTGGGCAC 

ATTGGATGAA 

GAAGATGCTT 

TTATGGGTAC 

GGGGAGGGAA 

CTTATACATA 

AAAGAAGAAA 

GCTTGAGTAC 

GATAGATGAG 

CGAAATAGCC 

TGAGGAGGCA 

TCCAGAAAAG 

AGGTCCGCAC 

CATGGTGATA 

TGCAGAGGAG 

GGTTTTACCT 

GTGGCAGAAG 



Deep Vent 
ATGATACTTG 
AAAGAAAACG 
CTCCTCAAAG 
AAGATAGTGA 
GAGGTATGGA 
AGAGAGCATT 
CTAATAGACA 
GACATAGAAA 
AGCTATGCTG 
GTCGAGGTAG 
AAAGATCCCG 
AAGAGGGCCG 
ATGCAGAGGC 
TACCACGTGA 
GCAATCTTCG 
ACTGGAAAGG 
GAGCTCGGTA 
CTGTGGGATG 
GCCTACGAGA 
CTAAGGGAGA 
TTAGTTTCCC 
CCGGATACGC 
AAGTTCTGCA 
AGGC7VAGAAA 



T542P NNN- CCT, CCA, CCG, CCC (All possible P codoixs) 
ACGCTGACTA CATCACCGAG GATGGGAAGC CGATTATAAG GATTTTCAAG 
GCGAGTTTAA GGTTGAGTAC GACAGAAACT TTAGACCTTA CATTTACGCT 
ATGACTCGCA GATTGATGAG GTTAGGAAGA TAACCGCCGA GAGGCATGGG 
GAATTATAGA TGCCGAAAAG GTAAGGAAGA AGTTCCTGGG GAGGCCGATT 
GGCTGTACTT TGAACACCCT CAGGACGTTC CCGCAATAAG GGATAAGATA 
CCGCAGTTAT TGACATCTTT GAGTACGACA TTCCGTTCGC GAAGAGGTAC 
AAGGCCTAAT TCCAATGGAA GGCGATGAAG AGCTGAAGTT GCTCGCATTT 
CCCTCTATCA CGAAGGGGAGT GAGTTCGCGA AGGGGCCCAT TATAATGATA 
ATGAGGAAGA AGCCAAAGTC ATAACGTGGA AAAAGATCGA TCTCCCGTAC 
TTTCCAGCGA GAGGGAGATG ATAAAGCGGT TCCTCAAGGT GATAAGGGAG 
ATGTTATAAT TACCTACAAC GGCGATTCTT TCGACCTTCC CTATCTAGTT 
AAAAGCTCGG GATAAAGCTA CCCCTGGGAA GGGACGGTAG TGAGCCAAAG 
TTGGGGATAT GACAGCGGTG GAGATAAAGG GAAGGATACA CTTTGACCTC 
TTAGGAGAAC GATAAACCTC CCAACATACA CCCTCGAGGC AGTTTATGAG 
GAAAGCCAAA GGAGAAAGTT TACGCTCACG AGATAGCTGA GGCCTGGGAG 
GACTGGAGAG AGTTGCAAAG TATTCAATGG AGGATGCAAA GGTAACGTAC 
GGGAGTTCTT CCCAATGGAG GCCCAGCTTT CAAGGTTAGT CGGCCAGCCC 
TTTCTAGGTC TTCAACTGGC AACTTGGTGG AGTGGTACCT CCTCAGGAAG 
GGAATGAATT GGCTCCAAAC AAGCCGGATG AGAGGGAGTA CGAGAGAAGG 
GCTACGCTGG GGGATACGTT AAGGAGCCGG AGAAAGGGCT CTGGGAGGGG 
TAGATTTCAG GAGCCTGTAC CCCTCGATAA TAATCACCCA TAACGTCTCA 
TGAACAGGGA AGGGTGTAGG GAATACGATG TCGCCCCAGA GGTTGGGCAC 
AGGACTTCCC GGGGTTTATC CCCAGCCTGC TCAAGAGGTT ATTGGATGAA 
TAAAAAGGAA GATGAAAGCT TCTAAAGACC CAATCGAGAA GAAGATGCTT 



840 
900 
960 
1020 
1080. 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2328 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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GATTACAGGC 
GCAAAAGCCC 
TATATAGAGT 
GACNNN6ATG 
GCCCTAGAGT 
GAGGGCTTCT 
GAAGGGAAGA 
AAAGAAACCC 
GTAAAGATAG 
CTAGTTATTT 
GTTGCCGTGG 
GGGTACATAG 
TTCGATCTCA 
GCCGTTCTTA 
ACTAAACAGA 
[SEQ ID NO, 



AACGGGCiWlT 
GTTGGTACTG 
TCGTAAGGAA 
GACTCTACGC 
TCGTAGATTA 
ACGTGAGAGG 
TAATCACTAG 
AAGCAAAAGT 
TTAAGGAGGT 
ACGAGCAGAT 
CAAAAAGGTT 
TGCTGAGGGG 
GGAAGCATAA 
GAATATTAGA 
CAGGTCTTAC 
118] 



CAAAATCCTG 
TAAGGAGTGC 
GGAACTGGAG 
CACAATTCCT 
TATAAACGCC 
GTTCTTCGTG 
GGGGCTTGAA 
CCTAGAGGCT 
AACTGAAAAG 
CACGAGGCCC 
AGCCGCTAGA 
AGACGGGCCA 
GTATGACGCT 
GGCCTTTGGG 
GGCATGGCTT 



GCAAACAGCT 
GCAGAGAGCG 
GAAAAGTTCG 
GGGGCAAAAC 
AAGCTCCCAG 
ACG7VAGAAGA 
ATAGTCAGGA 
ATCCTAAAGC 
CTGAGCAAGT 
CTTCACGAGT 
GGAGTAAAGG 
ATAAGCAAGA 
GAGTATTACA 
TACAGGAAAG 
AACATCAAiSA 



ATTATGGGTA 
TTACGGCCTG 
GGTTCAAAGT 
CCGAGGAGAT 
GGCTGTTGGA 
AGTATGCGTT 
GGGACTGGAG 
ATGGCAACGT 
ACGAAATACC 
ACAAGGCTAT 
TGAGGCCTGG 
GGGCTATCCT 
TAGAAAATCA 
AAGACCTCAG 
AGAAGTAA 



TTATGGGTAC 
GGGGAGGGAA 
CTTATACATA 
AAAGAAGAAA 
GCTTGAGTAC 
GATAGATGAG 
CGAAATAGCC 
TGAGGAGGCA 
TCCAGAAAAG 
AGGTCCGCAC 
CATGGTGATA 
TGCAGAGGAG 
GGTTTTACCT 
GTGGCAGAAG 



Deep Vent D543G NNN=GGT, GGC^ GGA, GGG (All 
ATGATACTTG ACGCTGACTA CATCACCGAG GATGGGAAGC 
AAAGAAAACG GCGAGTTTAA GGTTGAGTAC GACAGAAACT 
CTCCTCAAAG ATGACTCGCA GATTGATGAG GTTAGGAAGA 
AAGATAGTGA GAATTATAGA TGCCGAZVAAG GTAAGGAAGA 
GAGGTATGGA GGCTGTACTT TGAACACCCT CAGGACGTTC 
AGAGAGCATT CCGCAGTTAT TGACATCTTT GAGTACGACA 
CTAATAGACA AAGGCCTAAT TCCAATGGAA GGCGATGAAG 
GACATAGAAA CCCTCTATCA CGAAGGGGAG GAGTTCGCGA 
AGCTATGCTG ATGAGGAAGA AGCCAAAGTC ATAACGTGGA 
GTCGAGGTAG TTTCCAGCGA GAGGGAGATG ATAAAGCGGT 
AAAGATCCCG ATGTTATAAT TACCTACAAC GGCGATTCTT 
AAGAGGGCCG AAAAGCTCGG GATAAAGCTA CCCCTGGGAA 
ATGCAGAGGC TTGGGGATAT GACAGCGGTG GAGATA7VAGG 
TACCACGTGA TTAGGAGAAC GATAAACCTC CCAACATACA 
GCAATCTTCG GAAAGCCAAA GGAGAAAGTT TACGCTCACG 
ACTGGAAAGG GACTGGAGAG AGTTGCAAAG TATTCAATGG 
GAGCTCGGTA GGGAGTTCTT CCCAATGGAG GCCCAGCTTT 
CTGTGGGATG TTTCTAGGTC TTCAACTGGC AACTTGGTGG 
GCCTACGAGA GGAATGAATT GGCTCCAAAC AAGCCGGATG 
CTAAGGGAGA GCTACGCTGG GGGATACGTT AAGGAGCCGG 
TTAGTTTCCC TAGATTTCAG GAGCCTGTAC CCCTCGATAA 
CCGGATACGC TGAACAGGGA AGGGTGTAGG GAATACGATG 
AAGTTCTGCA AGGACTTCCC GGGGTTTATC CCCAGCCTGC 
AGGCAAGAAA TAAAAAGGAA GATGAAAGCT TCTAAAGACC 
GATTACAGGC AACGGGCAAT CATUUITCCTG GCAAACAGCT 
GCAZU\AGCCC GTTGGTACTG TAAGGAGTGC GCAGAGAGCG 
TATATAGAGT TCGTAAGGAA GGAACTGGAG GAAAAGTTCG 
GACAC ANNK G GACTCTACGC CACAATTCCT GGGGCAAAAC 
GCCCTAGAGT TCGTAGATTA TATAAACGCC AAGCTCCCAG 
GAGGGCTTCT ACGTGAGAGG GTTCTTCGTG ACGAAGAAGA 
GAAGGGAAGA TAATCACTAG GGGGCTTGAA ATAGTCAGGA 
AAAGAAACCC AAGCAAAAGT CCTAGAGGCT ATCCTAAAGC 
GTAAAGATAG TTAAGGAGGT AACTGAAAAG CTGAGCAAGT 
CTAGTTATTT ACGAGCAGAT CACGAGGCCC CTTCACGAGT 
GTTGCCGTGG CAAAAAGGTT AGCCGCTAGA GGAGTAAAGG 



possdJdle 6 

CGATTATAAG 

TTAGACCTTA 

TAACCGCCGA 

AGTTCCTGGG 

CCGCAATAAG 

TTCCGTTCGC 

AGCTCAAGTT 

AGGGGCCCAT 

AAAAGATCGA 

TCCTCAAGGT 

TCGACCTTCC 

GGGACGGTAG 

GAAGGATACA 

CCCTCGAGGC 

AGATAGCTGA 

AGGATGCAAA 

CAAGGTTAGT 

AGTGGTACCT 

AGAGGGAGTA 

AGAAAGGGCT 

TAATCACCCA 

TCGCCCCAGA 

TCAAGAGGTT 

CAATCGAGAA 

ATTATGGGTA 

TTACGGCCTG 

GGTTCAAAGT 

CCGAGGAGAT 

GGCTGTTGGA 

AGTATGCGTT 

GGGACTGGAG 

ATGGCAACGT 

ACGAAATACC 

ACAAGGCTAT 

TGAGGCCTGG 



codons) 

GATTTTCAAG 

CATTTACGCT 

GAGGCATGGG 

GAGGCCGATT 

GGATAAGATA 

GAAGAGGTAC 

GCTCGCATTT 

TATAATGATA 

TCTCCCGTAC 

GATAAGGGAG 

CTATCTAGTT 

TGAGCCAAAG 

CTTTGACCTC 

AGTTTATGAG 

GGCCTGGGAG 

GGTAACGTAC 

CGGCCAGCCC 

CCTCAG6AAG 

CGAGAGAAGG 

CTGGGAGGGG 

TAACGTCTCA 

GGTTGGGCAC 

ATTGGATGAA 

GAAGATGCTT 

TTATGGGTAC 

GGGGAGGGAA 

CTTATACATA 

AAAGAAGAAA 

GCTTGAGTAC 

GATAGATGAG 

CGAAATAGCC 

TGAGGAGGCA 

TCCAGAA7\AG 

AGGTCCGCAC 

CATGGTGATA 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2328 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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GGGTACATAG TGCTGAGGGG AGACGGGCCA ATAAGCAAGA GGGCTATCCT TGCAGAGGAG 
TTCGATCTCA GGAAGCATAA GTATGACGCT GAGTATTACA TAGAAAATCA GGTrTTACCT 
GCCGTTCTTA GAATATTAGA GGCCTTTGGG TACAGGAAAG AAGACCTCAG GTGGCAGAAG 
ACTAT^CAGA CAGGTCTTAC GGCATGGCTT AACATCAAGA AGAAGTAA 
[SEQ ID NO. 119] 



Deep Vent K593T NNN=ACT, ACC, ACA, ACG (All 
ATGATACTTG ACGCTGACTA CATCACCGAG GATGGGAAGC 
AAAGAAAACG GCGAGTTTAA GGTTGAGTAC GACAGAAACT 



CTCCTCAAAG ATGACTCGCA GATTGATGAG GTTAGGAAGA 
AAGATAGTGA GAATTATAGA TGCCGAAAAG GTAAGGAAGA 
GAGGTATGGA GGCTGTACTT TGAACACCCT CAGGACGTTC 
AGAGAGCATT CCGCAGTTAT TGACATCTTT GAGTACGACA 
CTAATAGACA AAGGCCTAAT TCCAATGGAA GGCGATGAAG 
GACATAGAAA CCCTCTATCA' CGAAGGGGAG GAGTTCGCGA 
AGCTATGCTG ATGAGGAAGA AGCCAAAGTC ATAACGTGGA 
GTCGAGGTAG TTTCCAGCGA GAGGGAGATG ATAAAGCGGT 
AAAGATCCCG ATGTTATAAT TACCTACAAC GGCGATTCTT 
AAGAGGGCCG AAAAGCTCGG GATAAAGCTA CCCCTGGGAA 
ATGCAGAGGC TTGGGGATAT GACAGCGGTG GAGATAAAGG 
TACCACGTGA TTAGGAGAAC GATAAACCTC CCAACATACA 
GCAATCTTCG GAAAGCCAAA GGAGAAAGTT TACGCTCACG 
ACTGGAAAGG GACTGGAGAG AGTTGCAAAG TATTCAATGG 
GAGCTCGGTA GGGAGTTCTT CCCAATGGAG GCCCAGCTTT 
CTGTGGGATG TTTCTAGGTC TTCAACTGGC AACTTGGTGG 
GCCTACGAGA .GGAATGAATT GGCTCCAAAC AAGCCGGATG 
CTAAGGGAGA GCTACGCTGG GGGATACGTT AAGGAGCCGG 
TTAGTTTCCC TAGATTTCAG GAGCCTGTAC CCCTCGATAA 
CCGGATACGC .TGAACAGGGA AGGGTGTAGG GAATACGATG 
AAGTTCTGCA AGGACTTCCC GGGGTTTATC CCCAGCCTGC 
AGGCAAGAAA ..TAAAAAGGAA GATGAAAGCT TCTAAAGACC 
GATTACAGGC AACGGGCAAT CAAAATCCTG GCAAACAGCT 
GCAAAAGCCC GTTGGTACTG TAAGGAGTGC GCAGAGAGCG 
TATATAGAGT TCGTAAGGAA GGAACTGGAG GAAAAGTTCG 
GACACAGATG GACTCTACGC CACAATTCCT GGGGCAAAAC 
GCCCTAGAGT TCGTAGATTA TATAAACGCC AAGCXCCCAG 
GAGGGCTTCT ACGTGAGAGG GTTCTTCGTG ACGAAGNNKA 
GAAGGGAAGA TAATCACTAG GGGGCTTGAA ATAGTCAGGA 
AAAGAAACCC AAGCAAAAGT CCTAGAGGCT ATCCTAAAGC 
GTAAAGATAG TTAAGGAGGT AACTGAAAAG CTGAGCAAGT 
CTAGTTATTT ACGAGCAGAT CACGAGGCCC CTTCACGAGT 
GTTGCCGTGG CAAAAAGGTT AGCCGCTAGA GGAGTAAAGG 
GGGTACATAG TGCTGAGGGG, AGACGGGCCA ATAAGCAAGA 
TTCGATCTCA GGAAGCATAA GTATGACGCT GAGTATTACA 
GCCGTTCTTA GAATATTAGA GGCCTTTGGG TACAGGAAAG 
ACTAAACAGA CAGGTCTTAC GGCATGGCTT AACATCAAGA 
[SEQ ID NO. 1201 



possi±>le T 
CGATTATAAG 
TTAGACCTTA 
TAACCGCCGA 
AGTTCCTGGG 
CCGCAATAAG 
TTCCGTTCGC 
AGCTCAAGTT 
AGGGGCCCAT 
AAAAGATCGA 
TCCTCAAGGT 
TCGACCTTCC 
GGGACGGTAG 
GAAGGATACA 
CCCTCGAGGC 
AGATAGCTGA 
AGGATGCT^AA 
CAAGGTTAGT 
AGTGGTACCT 
AGAGGGAGTA 
AGAAAGGGCT 
TAATCACCCA 
TCGCCCCAGA 
TCAAGAGGTT 
CAATCGAGAA 
ATTATGGGTA 
TTACGGCCTG 
GGTTCAAAGT 
CCGAGGAGAT 
GGCTGTTGGA 
AGTATGCGTT 
GGGACTGGAG 
ATGGCAACGT 
ACGAAATACC 
ACAAGGCTAT 
TGAGGCCTGG 
GGGCTATCCT 
TAGAAAATCA 
AAGACCTCAG 
AGAAGTAA 



codons) 
GATTTTCAAG 
CATTTACGCT 
GAGGCATGGG 
GAGGCCGATT 
GGATAAGATA 
GAAGAGGTAC 
GCTCGCATTT 
TATAATGATA 
TCTCCCGTAC 
GATAAGGGAG 
CTATCTAGTT 
TGAGCCAAAG 
CTTTGACCTC 
AGTTTATGAG 
GGCCTGGGAG 
GGTAACGTAC 
CGGCCAGCCC 
CCTCAGGAAG 
CGAGAGAAGG 
CTGGGAGGGG 
TAACGTCTCA 
GGTTGGGCAC 
ATTGGATGAA 
GAAGATGCTT 
TTATGGGTAC 
GGGGAGGGAA 
CTTATACATA 
AAAGAAGAAA 
GGTTGAGTAC 
GATAGATGAG 
CGAAATAGCC 
TGAGGAGGCA 
TCCAGAAAAG 
AGGTCCGCAC 
CATGGTGATA 
TGCAGAGGAG 
GGTTTTACCT 
GTGGCAGAAG 



2160 
2220 
2280 
2328 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
' 900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2328 



Tgo Y384N NNN=:AAT^ AAC 

Tgo Y384Ii NNN=a?TA, TT6, CTT, CTC, CTA, CTG 
Tgo Y384H NNN=CAT, CAC 
Tgo Y384Q NNN=CAA, GAG 
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Tgo Y384S NNN=TCT, TCC, TCA, TCG, AGT, AGO 



atgatcctcg 

aaggagaacg 

ctcttgaagg 

actaccgtta 

gaggtctgga 

aaggagcatc 

ctca-tagaca 

gacatcgaga 

agctacgccg 

gtcgacgtcg 

aaggatcccg' 

aagcgctccg 

atccagcgca 

taccc'cgtca 

gccatctttg 

acgggcgagg 

gaactcggaa 

ctctgggatg- 

gcctacgaga 

agggagagcn 

gtgtatctgg 

gatacactca 

ttctgcaagg 

cagaaggtaa 

tacaggcaac 

aaggcccgct 

atcgagacca 

acagatggat 

aaggagttcc 

ggcttctaca 

gacaagataa 

gagacgcagg 

aggat-tgtca 

gtcatctacg 

gctgttgcaa 

tacatcgtgc 

gacccggcaa 

gtggagagga 

cggcaggttg 

[SEQ ID NO. 



atacagacta 
gcgagttcac 
acgactctcc 
gggttgtcag 
agctctactt 
ctgccgttgt 
aaggcttaat 
.cgctctatca 
acgaggaagg 
tttccaccga 
acgtcctcat 
agaagctcgg 
tgggcgatcg 
ttaggagaac 
gacagccgaa 
gattagaaag 
aagagttctt 
tatctcgctc 
ggaatgaact 
nngcgggtgg 
acttccgctc 
acagggaggg 
acttccccgg 
agaagaagat 
gagcaatcaa 
ggtactacaa 
cgataaggga 
ttttcgcaac 
tggactacat 
agcgcggctt 
cgacgcgcgg 
cgagggttct 
aagaggttac 
agcaga-taac 
aacgcctcgc 
tcaaaggctc 
agcacaagta 
ttctgagggc 
gcttgggggc 
121] 



cataactgag 

catagactac 

gattgaggac 

ggccgagaaa 

cactcacGcc 

ggacatctac 

cccgatggag 

cgagggcgag 

ggcgcgcgtt 

gaaggagatg 

aatctacaac 

agtcaagttc 

ctttgcggtg 

gattaacctc 

ggagaaggtc 

ggtggcccgc 

ccctatggaa 

gagtaccgga 

tgcaccaaac 

atacgtcaag 

cctgtatcct 

ttgtgaggag 

cttcatccca 

gaaggccact 

aatccttgct 

ggagtgcgcc 

aatagaggag 

aatacctgga 

caacgccaaa 

cttcgtgacg 

gcttgaaata 

tgaggcgata 

ggagaagctg 

ccgcgacctg 

cgcaaggggg 

g^gaaggatt 

cgatgcagaa 

ctttggttac 

gtggctiaaaa 



gatggaaagc 

gacagaaact 

gtcaagaaga 

gtgaagaaga 

caggacgttc 

gagtacgaca 

ggcgacgagg 

gagttcgccg 

attacctgga 

ataaagcgct 

ggcgacaact 

atcctcggaa 

gaggtcaagg 

cccacttaca 

tacgctgagg 

tactcgatgg 

gcccagctct 

aacctcgtcg 

aagccggacg 

gagcccgaaa 

tcgataataa 

tacgacgtgg 

agcctcctcg 

a-kagacccaa 

aatagcttct 

gagagcgtta 

aaatttggct 

gcggacgccg 

ctgcccggcc 

aagaagaagt 

gttaggcgtg 

ctaaagcacg 

agcaagtacg 

aaggactaca 

ataaaaatcc 

ggggacaggg 

tactacatcg 

cgtaaagaag 

cctaagacat 



ccgtcatcag 

ttgagccata 

taactgccga 

agttcctagg 

ccgcaatcag 

tccccttcgc 

aacttaagat 

aagggcctat 

agaatatcga 

tcctcaaggt 

tcgacttcgc 

gggaagggag 

gaaggattca 

cccttgaggc 

agatagcgca 

aggacgcgaa 

cgcgcctcgt 

agtggttttt 

agagggagct 

ggggactgtg 

tcacccataa 

ctcctcaggt 

gagacctctt 

tcgagaagaa 

acggttacta 

ccggttgggg 

ttaaagtcct 

aaaccgtcaa 

tgctcgaact 

acgcggttat 

actggagcga 

gtgacgttga 

aggttccacc 

aggccaccgg 

ggcccggaac 

ctataccctt 

agaaccaggt 

atttaaggta 

ga 



gatcttcaag 
catctacgcg 
gaggcacggc 
caggccgata 
ggacaagata 
gaagcgctac 
gctcgccttc 
cctgatgata 
ccttccctat 
cgtcaaggaa 
ctacctcaag 
cgaaccgaaa 
cttcgacctc 
agtatatgaa 
ggcctgggaa 
ggtaacctat 
aggccagagc 
gctgaggaag 
ggcaagaaga 
ggagaacatc 
cgtctcccct 
aggccataag 
ggaggagaga 
actcctcgat 
cggctataca 
cagggagtac 
ctacgcggac 
aaagaaggca 
cgaatacgag 
agacgaggag 
gatagcgaag 
agaagcggta 
ggagaagctg 
gccgcatgtg 
ggtcataagc 
tgacgaattt 
tcttccagct 
tcagaaaacg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2322 



Tgo G386S NNN^-TCT,. TCC, TCA, TCG, AGT, AGC 
Tgo 638 6P NNII=<;CT, CCA^ CCG, CCC 

atgatcctcg atacagacta cataactgag gatggaaagc ccgtcatcag gatcttcaag 60 

aaggagaacg gcgagttcac catagactac gacagaaact ttgagccata catctacgcg 120 

ctcttgaagg acgactctcc gattgaggac gtcaagaaga taactgccga gaggcacggc 180 

actaccgtta gggttgtcag ggccgagaaa gtgaagaaga agttcctagg caggccgata 240 

gaggtctgga agctctactt cactcacccc caggacgttc ccgcaatcag ggacaagata 300 

aaggagcatc ctgccgttgt ggacatctac gagtacgaca tccccttcgc gaagcgctac 360 
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ctcatagaca 
gacatcgaga 
agctacgccg 
gtcgacgtcg 
aaggatcccg 
aagcgctccg 
atccagcgca 
taccccgtca 
gccatctttg 
sicgggcgagg 
gaactcggaa 
ctctgggatg 
gcctacgaga 
agggagagct 
gtgtatctgg 
gatacactca 
ttctgcaagg 
cagaaggtaa 
tacaggcaac 
aaggcccgct 
atcgagacca 
acagatggat 
aaggagttcc 
ggcttctaca 
gacaagataa 
gagacgcagg 
aggattgtca 
gtcatctacg 
gctgttgcaa 
tacatcgtgc 
gacccggcaa; 
gtggagagga 
cggcaggttg 
£SEQ ID NO 



aaggcttaat 
cgctctatca 
acgaggaagg 
tttccaccga 
acgtcctcat 
agaagctcgg 
tgggcgatcg 
ttaggagaac 
gacagccgaa 
gattagaaag 
aagagttctt 
tatctcgctc 
ggaatgaact 
acgcgnnngg 
acttccgctc 
acagggaggg 
acttccccgg 
agaagaagat 
gagcaatcaa 
ggtactacaa 
cgataaggga 
ttttcgcaac 
tggactacat 
agcgcggctt 
cgacgcgcgg 
cgagggttct 
aagaggttac 
agcagataac 
aacgcctcgc 
tcaaaggctc 
agcacaagta 
ttctgagggc 
gcttgggggc 
122] 



cccgatggag 
cgagggcgag 
ggcgcgcgtt 
gaaggagatg 
aatctacaac 
agtcaagttc 
ctttgcggtg 
gattaacctc 
ggagaaggtc 
ggtggcccgc 
ccctatggaa 
gagtaccgga 
tgcaccaaac 
atacgtcaag 
cctgtatcct 
ttgtgaggag 
cttcatccca 



gaaggccact 
aatccttgct 
ggagtgcgcc 
aatagaggag 
aatacctgga 
caacgccaaa 
cttcgtgacg 
gcttgaaata 
tgaggcgata 
ggagaagctg 
ccgcgacctg 
cgcaaggggg 
gggaaggatt 
cgatgcagaa 
ctttggttac 
gtggctaaaa 



ggcgacgagg 
gagttcgccg 
attacctgga 
ataaagcgct 
ggcgacaact 
atcctcggaa 
gaggtcaagg 
cccacttaca 
tacgctgagg 
tactcgatgg 
gcccagctct 
aacctcgtcg 
aagccggacg 
gagcccgaaa 
tcgataataa 
tacgacgtgg 
agcctcctcg 
atagacccaa 
aatagcttct 
gagagcgtta 
aaatttggct 
gcggacgccg 
ctgcccggcc 
aagaagaagt 
gttaggcgtg 
ctaaagcacg 
agcaagtacg 
aaggactaca 
ataaaaatcc 
ggggacaggg 
tactacatcg 
cgtaaagaag 
cctaagacat 



aacttaagat 
aagggcctat 
agaatatcga 
tcctcaaggt 
tcgacttcgc 
gggaagggag 
gaaggattca 
cccttgaggc 
agatagcgca 
aggacgcgaa 
cgcgcctcgt 
agtggttttt 
agagggagct 

ggggactgtg 

tcacccataa 
ctcctcaggt 
gagacctctt 
tcgagaagaa 
acggttacta 
ccggttgggg 
ttaaagtcct 
aaaccgtcaa 
tgctcgaact 
acgcggttat 
actggagcga 
gtgacgttga 
aggttccacc 
aggccaccgg 
ggcccggaac 
ctataccctt 
agaaccaggt 
atttaaggta 
ga 



gctcgccttc- 
cctgatgata 
ccttccctat 
cgtcaaggaa 
ctacctcaag 
cgaaccgaaa 
cttcgacctc 
agtatatgaa 
ggcctgggaa 
ggtaacctat 
aggccagagc 
gctgaggaag 
ggcaagaaga 
ggagaacatc 
cgtctcccct 
aggccataag 
ggaggagaga 
actcctcgat 
cggctataca 
cagggagtac 
ctacgcggac 
aaagaaggca 
cgaatacgag 
agacgaggag 
gatagcgaag 
agaagcggta 
ggagaagctg 
gccgcatgtg 
ggtcataagc 
tgacgaattt 
tcttccagct 
tcagaaaacg 



420 

480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2322 



Tgo G387A NNN=GCA, GOT, GCC, GCG 
Tgo G386P NNN=CCT, CCA.^ CCG, CCC 



atgatcctcg 
aaggagaacg 
ctcttgaagg 
actaccgtta 
gaggtctgga 
aaggagcatc 
ctcatagaca 
gacatcgaga 
agctacgccg 
gtcgacgtcg 
aaggatcccg 
aagcgctccg 
atccagcgca 
taccccgtca 



atacagacta 
gcgagttcac 
acgactotcc 
gggttgtcag 
agctctactt 
ctgccgttgt 
aaggcttaat 
cgctctatca 
acgaggaagg 
tttccaccga 
acgtcctcat 
agaagctcgg 
tgggcgatcg 
ttaggagaac 



cataactgag 
catagactac 
gattgaggac 
ggccgagaaa 
cactcacccc 
ggacatctac 
cccgatggag 
cgagggcgag 
ggcgcgcgtt 
gaaggagatg 
aatctacaac 
agtcaagttc 
ctttgcggtg 
gattaacctc 



gatggaaagc 
gacagaaact 
gtcaagaaga 
gtgaagaaga 
caggacgttc 
gagtacgaca 
ggcgacgagg 
gagttcgccg 
attacctgga 
ataaagcgct 
ggcgacaact 
atcctcggaa 
gaggtcaagg 
cccacttaca 



ccgtcatcag 
ttgagccata 
taact^ccga 
agttcctagg 
ccgcaatcag 
tccccttcgc 
aacttaagat 
aagggcctat 
agaatatcga 
tcctcaaggt 
tcgacttcgc 
gggaagggag 
gaaggattca 
cccttgaggc 



gatcttcaag 
catctacgcg 
gaggcacggc 
caggccgata 
ggacaagata 
gaagcgctac 
gctcgccttc 
cctgatgata 
ccttccctat 
cgtcaaggaa 
ctacctcaag 
cgaaccgaaa 
cttcgacctc 
agtatatgaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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gccatctttg gacagccgaa ggagaaggtc 
acgggcgagg gattagaaag ggtggcccgc 
gaactcggaa aagagttctt ccctatggaa 
ctctgggatg tatctcgctc gagtaccgga 
gcctacgaga ggaatgaact tgcaccaaac 
agggagagct acgcgggtnn ntacgtcaag 
gtgtatctgg acttccgctc cctgtatcct 
gatacactca acagggaggg ttgtgaggag 
ttctgcaagg acttccccgg cttcatccca 
cagaaggtaa agaagaagat gaaggccact 
tacaggcaac gagcaatcaa aatccttgct 
aaggcccgct ggtactacaa ggagtgcgcc 
atcgagacca cgataaggga aatagaggag 
acagatggat ttttcgcaac aatacctgga 
aaggagttcc tggactacat caacgccaaa 
ggcttctaca agcgcggctt cttcgtgacg 
gacaagataa cgacgcgcgg gcttgaaata 
gagacgcagg cgagggttct tgaggcgata 
aggattgtca aagaggttac ggagaagctg 
gtcatctacg agcagataac ccgcgacctg 
gctgttgcaa aacgcctcgc cgcaaggggg 
tacatcgtgc tcaaaggctc gggaaggatt 
gacccggcaa agcacaagta cgatgcagaa 
gtggagagga. ttctgagggc ctttggttac 
cggcaggttg gcttgggggc gtggctaaaa 
ISEQ ID NO. 123] 



tacgctgagg agatagcgca ggcctgggaa 
tactcgatgg aggacgcgaa ggtaacctat 
gcccagctct cgcgcctcgt aggccagagc 
aacctcgtcg agtggttttt gctgaggaag 
aagccggacg agagggagct ggcaagaaga 
gagcccgaaa ggggactgtg ggagaacatc 
tcgataataa tcacccataa cgtctcccct 
tacgacgtgg ctcctcaggt aggccataag 
agcctcctcg gagacctctt ggaggagaga 
atagacccaa tcgagaagaa actcctcgat 
aatagcttct acggttacta cggctataca 
gagagcgtta ccggttgggg cagggagtac 
aaatttggct ttaaagtcct ctacgcggac 
gcggacgccg aaaccgtcaa aaagaaggca 
ctgcccggcc tgctcgaact cgaatacgag 
aagaagaagt acgcggttat agacgaggag 
gttaggcgtg actggagcga gatagcgaag 
ctaaagcacg gtgacgttga agaagcggta 
agcaagtacg aggttccacc ggagaagctg 
aaggactaca aggccaccgg gccgcatgtg 
ataaaaatcc ggcccggaac ggtcataagc 
ggggacaggg ctataccctt tgacgaattt 
tactacatcg agaaccaggt tcttccagct 
cgtaaagaag atttaaggta tcagaaaacg 
cctaagacat ga 



Tgo D404E NNN=GAA, GAG 

atgatcctcg atacagacta cataactgag 
aaggagaacg gcgagtteac catagactac 
ctcttgaagg acgactctcc gattgaggac 
actaccgtta gggttgtcag ggccgagaaa 
gaggtctgga agctctactt cactcacccc 
aaggagcatc ctgccgttgt ggacatctac 
ctcatagaca aaggcttaat cccgatggag 
gacatcgaga cgctctatca cgagggcgag 
agctacgccg acgaggaagg ggcgcgcgtt 
gtcgacgtcg tttccaccga gaaggagatg 
aaggatcccg acgtcctcat aatctacaac 
aagcgctccg agaagctcgg agtcaagttc 
atccagcgca tgggcgatcg ctttgcggtg 
taccccgtca ttaggagaac gattaacctc 
gccatctttg gacagccgaa ggagaaggtc 
acgggcgagg gattagaaag ggtggcccgc 
gaactcggaa aagagttctt ccctatggaa 
ctctgggatg tatctcgctc gagtaccgga 
gcctacgaga ggaatgaact tgcaccaaac 
agggagagct acgcgggtgg atacgtcaag 
gtgtatctgn nnttccgctc cctgtatcct 
gatacactca acagggaggg ttgtgaggag 
ttctgcaagg acttccccgg cttcatccca 



gatggaaagc ccgtcatcag gatcttcaag 
gacagaaact ttgagccata catctacgcg 
gtcaagaaga taactgccga gaggcacggc 
gtgaagaaga agttcctagg caggccgata 
caggacgttc ccgcaatcag ggacaagata 
gagtacgaca tccccttcgc gaagcgctac 
ggcgacgagg aacttaagat gctcgccttc 
gagttcgccg aagggcctat cctgatgata 
attacctgga agaatatcga ccttccctat 
ataaagcgct tcctcaaggt cgtcaaggaa 
ggcgacaact tcgacttcgc ctacctcaag 
atcctcggaa gggaagggag cgaaccgaaa 
gaggtcaagg gaaggattca cttcgacctc 
cccacttaca cccttgaggc agtatatgaa 
tacgctgagg agatagcgca ggcctgggaa 
tactcgatgg aggacgcgaa ggtaacctat 
gcccagctct cgcgcctcgt aggccagagc 
aacctcgtcg agtggttttt gctgaggaag 
aagccggacg agagggagct ggcaagaaga 
gagcccgaaa ggggactgtg ggagaacatc 
tcgataataa tcacccataa cgtctcccct 
tacgacgtgg ctcctcaggt aggccataag 
agcctcctcg gagacctctt ggaggagaga 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2322 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 

1020 

1080 . 

1140 

1200 

1260 

1320 

1380 
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cagaaggtaa 

tacaggcaac 

aaggcccgct 

atcgagacca 

acagatggat 

aaggagttcc 

ggcttctaca 

gacaagataa 

gagacgcagg 

aggattgtca 

gtcatctacg 

gctgttgcaa 

tacatcgtgc 

gacccggcaa 

gtggagagga 

cggcaggttg 

[SEQ ID NO. 



agaagaagat 
gagcaatcaa 
ggtactacaa 
cgataaggga 
ttttcgcaac 
tggactacat 
agcgcggctt 
cgacgcgcgg 
cgagggttct 
aagaggttac 
agcagataac 
aacgcctcgc 
tcaaaggctc 
agcacaagta 
ttctgagggc 
gcttgggggc 
124] 



gaaggccact 

aatccttgct 

ggagtgcgcc 

aatagaggag 

aatacctgga 

caacgccaaa 

cttcgrtgacg 

gcttgaaata 

tgaggcgata 

ggagaagctg 

ccgcgacctg 

cgcaaggggg 

gggaaggatt 

cgatgcagaa 

•ctttggttac 

gtggctaaaa 



atagacccaa 

aatagcttct 

gagagcgtta 

aaatttggct 

gcggacgccg 

ctgcccggcc 

aagaagaagt 

gttaggcgtg 

ctaaagcacg 

agcaagtacg 

aaggactaca 

ataaaaatcc 

ggggacaggg 

tactacatcg 

cgtaaagaag 

cctaagacat 



tcgagaagaa 

acggttacta 

ccggttgggg 

ttaaagtcct 

aaaccgtcaa 

tgctcgaact 

acgcggttat 

actggagcga 

gtgacgttga 

aggttccacc 

aggccaccgg 

ggcccggaac 

ctataccctt 

agaaccaggt 

atttaaggta 

ga 



actcctcgat 

cggctataca 

cagggagtac 

ctacgcggac 

aaagaaggca 

cgaatacgag 

agacgaggag 

gatagcgaag 

agaagcggta 

ggagaagctg 

gccgcatgtg 

ggrtcataagc 

tgacgaattt 

tcttccagct 

tcagaaaacg 



1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2322 



Tgo T541P NNN=CCT, CCA, CCG, CCC 



atgatcctcg 
aaggagaacg 
ctcttgaagg 
actaccgtta 
gaggtctgga 
aaggagcatc 
ctcatagaca 
gacatcgaga 
agctacgccg 
gtcgacgtcg 
aaggatcccg 
aagcgctccg 
atccagcgca 
taccccgtca 
gccatctttg 

acgggcgagg 

gaactcggaa 
ctctgggatg 
gcctacgaga 
agggagagct 
gtgtatctgg 
gatacactca 
ttctgcaagg 
cagaaggtaa 
tacaggcaac 
aaggcccgct 
atcgagacca 
nnngatggat 
aaggagttcc 
ggcttctaca 
gacaagataa 
gagacgcagg 



atacagacta 
gcgagttcac 
acgactctcc 
gggttgtcag 
agctctactt 
ctgccgttgt 
aaggcttaat 
cgctctatca 
acgaggaagg 
tttccaccga 
acgtcctcat 
agaagctcgg 
tgggcgatcg 
ttaggagaac 
gacagccgaa 
gattagaaag 
aagagttctt 
tatctcgctc 
ggaatgaact 
acgcgggtgg 
acttccgctc 
acagggaggg 
acttccccgg 
agaagaagat 
gagcaatcaa 
ggtactacaa 
cgataaggga 
ttttcgcaac 
tggactacat 
agcgcggctt 
cgacgcgcgg 
cgagggttct 



cataactgag 
catagactac 
gattgaggac 
ggccgagaaa 
cactcacccc 
ggacatctac 
cccgatggag 
cgagggcgag 
ggcgcgcgtt 
gaaggagatg 
aatctacaac 
agtcaagttc 
ctttgcggtg 
gattaacctc 
ggagaaggtc 
ggtggcccgc 
ccctatggaa 
gagtaccgga 
tgcaccaaac 
atacgtcaag 
cctgtatcct 
ttgtgaggag 
cttcatccca 
gaaggccact 
aatccttgct 
ggagtgcgcc 
aatagaggag 
aatacctgga 
caacgccaaa 
cttcgtgacg 
gcttgaaata 
tgaggcgata 



gatggaaagc 
gacagaaact 
gtcaagaaga 
gtgaagaaga 
caggacgttc 
gagtacgaca 
ggcgacgagg 
gagttcgccg 
attacctgga 
ataaagcgct 
ggcgacaact 
atcctcggaa 
gaggtcaagg 
cccactt'aca 
tacgctgagg 
tactcgatgg 
gcccagctct 
aacctcgtcg 
aagccggacg 
gagcccgaaa 
tcgataataa 
tacgacgtgg 
agcctcctcg 
atagacccaa 
aatagcttct 
gagagcgtta 
aaatttggct 
gcggacgccg 
ctgcccggcc 
aagaagaagt 
gttaggcgtg 
ctaaagcacg 



ccgtcatcag 
ttgagccata 
taactgccga 
agttcctagg 
ccgcaatcag 
tccccttcgc 
aacttaagat 
aagggcctat 
agaatatcga 
tcctcaaggft 
tcgacttcgc 
gggaagggag 
gaaggattca 
cccttgaggc 
agatagcgca 
aggacgcgaa 
cgcgcctcgt 
agtggttttt 
agagggagct 
ggggactgtg 
tcacccataa 
ctcctcaggt 
gagacctctt 
tcgagaagaa 
acggttacta 
ccggttgggg 
ttaaagtcct 
aaaccgtcaa 
tgctcgaact 
acgcggttat 
actggagcga 
gtgacgttga 



gatcttcaag 
catctacgcg 
gaggcacggc 
caggccgata 
ggacaagata 
gaagcgctac 
gctcgccttc 
cctgatgata 
ccttccctat 
cgtcaaggaa 
ctacctcaag 
cgaaccgaaa 
cttcgacctc 
agtatatgaa 
ggcctgggaa 
ggtaacctat 
aggccagagc 
gctgaggaag 
ggcaagaaga 
ggagaacatc 
cgtctcccct 
aggccataag 
ggaggagaga 
actcctcgat 
cggctataca 
cagggagtac 
ctacgcggac 
aaagaaggca 
cgaatacgag 
agacgaggag 
gatagcgaag 
agaagcggta 
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aggattgtca 
gtcatctacg 
gctgttgcaa 
tacatcgtgc 
gacccggcaa 
gtggagagga 
cggcaggttg 
[SEQ ID NO. 



aagaggttac 
agcagataac 
aacgcctcgc 
tcaaaggctc 
agcacaagta 
ttctgagggc 
gcttgggggc 
125] 



ggagaagctg 
ccgcgacctg 
cgcaaggggg 
gggaaggatt 
cgatgcagaa 
ctttggttac 
gtggctaaaa 



agcaagtacg 
aaggactaca 
ataaaaatcc 
ggggacaggg 
tactacatcg 
cgtaaagaag 
cctaagacat 



aggttccacc 
aggccaccgg 
ggcccggaac 
ctataccctt 
agaaccaggt 
atttaaggta 
ga 



ggagaagctg 
gccgcatgtg 
ggtcataagc 
tgacgaattt 
tcttccagct 
tcagapiaacg 



1980 
2040 
2100 
2160 
2220 
2280 
2322 



Tgo D5426 NNN=GGT, GGA, GGG, GGC 



atgatcctcg 

aaggagaacg 

ctcttgaagg 

actaccgtta 

gaggtctgga 

aaggagcatc 

ctcatagaca 

gacatcgaga 

agctacgccg 

gtcgaccftcg 

aaggatcccg 

aagcgctccg 

atccagcgca 

taccccgtca 

gccatctttg 

acgggcgagg 

gaactcggaa 

ctctgggatg 

gcctacgaga 

agggagagct 

gtgtatctgg 

gatacactca 

ttctgcaagg 

cagaaggbaa 

tacaggcaac 

aaggcccgct 

atcgagacca 

acannnggat 

aaggagttcc 

ggcttctaca 

gacaagataa 

gagacgcagg 

aggattgtca 

gtcatctacg 

gctgttgcaa 

tacatcgtgc 

gacccggcaa 

gtggagagga 

cggfcaggttg 

I SEQ ID NO 



atacagacta 
gcgagttcac 
acgactctcc 
gggttgtcag 
agctctactt 
ctgccgttgt 
aaggcttaat 
cgctctatca 
acgaggaagg 
tttccaccga 
acgtcctcat 
agaagctcgg 
tgggcgatcg 
ttaggagaac 
gacagccgaa 
gattagaaag 
aagagttctt 
tatctcgctc 
ggaatgaact 
acgcgggtgg 
acttccgctc 
acagggaggg 
acttccccgg 
agaagaagat. 
gagcaatcaa 
ggtactacaa 
cgataaggga 
ttttcgcaac 
tggactacat 
agcgcggctt 
cgacgcgcgg 
cgagggttct 
aagaggttac 
agcagataac 
aacgcctcgc 
tcaaaggctc 
agcacaagta 
ttctgagggc 
gcttgggggc 
126] 



cataactgag 
catagactac 
gattgaggac 
ggccgagaaa 
cactcacccc 
ggacatctac 
cccgatggag 
cgagggcgag 
ggcgcgcgtt 
gaaggagatg 
aatctacaac 
agtcaagttc 
ctttgcggtg 
gattaacctc 
ggagaaggtc 
g'gtggcccgc 
ccctatggaa 
gagtaccgga 
tgcaccaaac 
atacgtcaag 
cctgtatcct 
ttgtgaggag 
cttcatccca 
gaaggccact 
aatccttgct 
ggagtgcgcc 
aatagaggag 
aatacctgga 
caacgccaaa 
cttcgtgacg 
gcttgaaata 
tgaggcgata 
ggagaagctg 
ccgcgacctg 
cgcaaggggg 
gggaaggatt 
cgatgcagaa 
ctttggttac 
gtggctaaaa 



gatggaaagc 
gacagaaact 
gtcaagaaga 
gtgaagaagd 
caggacgttc 
gagtacgaca 
ggcgacgagg 
gagttcgccg 
attacctgga 
ataaagcgct 
ggcgacaact 
atcctcggaa 
gaggtcaagg 
cccacttaca 
tacgctgagg 
tactcgatgg 
gcccagctct 
aacctcgtcg 
aagccggacg 
gagcccgaaa 
tcgataataa 
tacgacgtgg 
agcctcctcg 
atagacccaa 
aatagcttct 
gagagcgtta 
aaatttggct 
gcggacgccg 
ctgcccggcc 
aagaagaagt 
gttaggcgtg 
ctaaagcacg 
agcaagtacg 
aaggactaca 
ataaaaatcc 
ggggacaggg 
tactacatcg 
cgrtaaagaag 
cctaagacat 



ccgtcatcag 
ttgagccata 
taactgccga 
agttcctagg 
ccgcaatcag 
tccccttcgc 
aacttaagat 

^sgggcctat 

agaatatcga 
tcctcaaggt 
tcgacttcgc 
gggaagggag 
gaaggattca 
cccttgaggc 
agatagcgca 
aggacgogaa 
cgcgcctcgt 
agtggttttt 
agagggagct 
ggggactgtg 
tcacccataa 
ctcctcaggt 
gagacctctt 
tcgagaagaa 
acggttacta 
ccggttgggg 
ttaaagtcct 
aaaccgtcaa 
tgctcgaact 
acgcggttat 
actggagcga 
gtgacgttga 
aggttccacc 
aggccaccgg 
ggcccggaac 
ctataccctt 
agaaccaggt 
atttaaggta 
ga 



gatcttcaag 
catctacgcg 
gaggcacggc 
caggccgata 
ggacaagata 
gaagcgctac 
gctcgccttc 
cctgatgata 
ccttccctat 
cgtcaaggaa 
ctacctcaag 
cgaaccgaaa 
cttcgacctc 
agtatatgaa 
ggcctgggaa 
ggtaacctat 
aggccagagc 
gctgaggaag 
ggcaagaaga 
ggagaacatc 
cgtct cccct 
aggccataag 
ggaggagaga 
actcctcgat 
cggctataca 
cagggagtac 
ctacgcggac 
aaagaaggca 
cgaatacgag 
agacgaggag 
gatagcgaag 
agaagcggta 
ggagaagctg 
gccgcatgtg 
ggtcataagc 
tgacgaattt 
tcttccagct 
tcagaaaacg 
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Tgo K592T NNN=ACT, ACC, ACA, ACG 



atgatcctcg 
aaggagaacg 
ctcttgaagg 
actaccgtta 
gaggtctgga 
aaggagcatc 
ctcatagaca 
gacatcgaga 
agctacgccg 
gtcgacgtcg 
aaggatcccg 
aagcgctccg 
atccagcgca 
taccccgtca 
gccatctttg 
acgggcgagg 
gaactcggaa 
ctctgggatg 
gcctacgaga 
agggagagct 
gtgtatctgg 
gatacactca 
ttctgcaagg 
cagaaggtaa 
tacaggcaac 
aaggcccgct 
a-tcgagacca 
acagatggat 
aaggagttcc 
ggcttctaca 
gacaagataa 
gagacgcagg 
aggattgtca 
gtcatctacg 
gctgttgcaa 
tacatcgtgc 
gacccggcaa 
gtggagagga 
cggcaggttg 
[SEQ ID NO, 



atacagacta 
gcgagttcac 
acgactctcc 
gggttgtcag 
agctctactt 
ctgccgttgt 
aaggcttaat 
cgctctatca 
acgaggaagg 
tttccaccga 
acgtcctcat 
agaagctcgg 
tgggcgatcg 
ttaggagaac 
gacagccgaa 
gattagaaag 
aagagttctt 
tatctcgctc 
ggaatgaact 
acgcgggtgg 
acttccgctc 
"acagggaggg 
acttccccgg 
agaagaagat 
gagcaatcaa 
ggtactacaa 
cgataaggga 
ttttcgcaac 
tggactacat 
agcgcggctt 
cgacgcgcgg 
cgagggttct 
aagaggttac 
agcagataac 
aacgcctcgc 
tcaaaggctc 
agcacaagrta 
ttctgagggc 
gcttgggggc 
127] 



cataactgag 
catagactac 
gattgaggac 
ggccgagaaa 
cactcacccc 
ggacatctac 
cccgatggag 
cgagggcgag 
ggcgcgcgtt 
gaaggagatg 
aatctacaac 



gatggaaagc 
gacagaaact 
gtcaagaaga 
gtgaagaaga 
caggacgttc 
gagtacgaca 



agtcaagttc 
ctttgcggtg 
gattaacctc 
ggagaaggtc 
ggtggcccgc 
ccctatggaa 
gagtaccgga 
tgcaccaaac 
atacgtcaag 
cctgtatcct 
ttgtgaggag 
cttcatccca 
gaaggccact 
aatccttgct 
ggagtgcgcc 
aatagaggag 
aatacctgga 
caacgccaaa 
cttcgtgacg 
gcttgaaata 
tgaggcgata 
ggagaagctg 
ccgcgacctg 
cgcaaggggg 
gggaaggatt 
cgatgcagaa 
ctttggttac 
gtggctaaaa 



ggcgacgagg 
gagttcgccg 
attacctgga 
ataaagcgct 
ggcgacaact 
atcctcggaa 
gaggtcaagg 
cccacttaca 
tacgctgagg 
tactcgatgg 
gcccagctct 
aacctcgtcg 
aagccggacg 
gagcccgaaa 
tcgataataa 
tacgacgtgg 
agcctcctcg 
atagacccaa 
aatagcttct 
gagagcgtta 
aaatttggct 
gcggacgccg 
ctgcccggcc 
aagnnnaagt 
gttaggcgtg 
ctaaagcacg 
agcaagtacg 
aaggactaca 
ataaaaatcc 
ggggacaggg 
tactacatcg 
cgrtaaagaag 
cctaagacat 



ccgtcatcag 
ttgagccata 
taactgccga 
agttcctagg 
ccgcaatcag 
tccccttcgc 
aacttaagat 
aagggcctat 
agaatatcga 
tcctcaaggt 
tcgacttcgc 
gggaagggag 
gaaggattca 
cccttgaggc 
agatagcgca 
aggacgcgaa 
cgcgcctcgt 
agtggttttt 
agagggagct 
ggggactgtg 
tcacccataa 
ctcctcaggt 
gagacctctt 
tcgagaagaa 
acggttacta 
ccggttgggg 
ttaaagtcct 
aaaccgtcaa 
tgctcgaact 
acgcggttat 
actggagcga 
gtgacgttga 
aggttccacc 
aggccaccgg 
ggcccggaac 
ctataccctt 
agaaccaggt 
atttaaggta 
ga 



gatcttcaag 
catctacgcg 
gaggcacggc 
caggccgata 
ggacaagata 
gaagcgctac 
gctcgccttc 
cctgatgata 
ccttccctat 
cgtcaaggaa 
ctacctcaag 
cgaaccgaaa 
cttcgacctc 
agtatatgaa 
ggcctgggaa 
ggtaacctat 
aggccagagc 
gctgaggaag • 
ggcaagaaga 
ggagaacatc 
cgtctcccct 
aggccataag 
ggaggagaga 
actcctcgat 
cggctataca 
cagggagtac 
ctacgcggac 
aaagaaggca 
cgaatacgag 
agacgaggag 
gatagcgaag 
agaagcggta 
ggagaagctg 
gccgcatgtg 
ggtcataagc 
tgacgaattt 
tcttccagct 
ticagaaaacg 
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